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GENE DELIVERY FUSION PROTEINS 

Technical Field 

S The invention relates to the field of gene delivery, more specifically to proteins 

usefiil for introducing polynucleotides into target cells. Still more specifically, the 
invention relates to fusion proteins that are capable of both binding to a polynucleotide 
of interest, and of facilitating delivery of the bound polynucleotide to a target cell, 
especially to a mammalian target cell. 

10 

Background 

Many viruses have been adapted for use as gene delivery vectors for mammalian 
cells. Viruses have hig^ efficient mechanisms for entering cells, and in some cases 
also have specific mechanisms fcnr int^ratnig die viral gaxmie into die host cell 

IS diromosome. The high efBciency of gene tninsdncrion afforded by the viral vectors is 
the prir»pal advantage of using a virus-based system for gene delivery. In addition, 
the fact that the viruses are parfirailate allows virus-based systems to be considered for 
in vivo gene delivery. These attributes have led to the wide use of viral vectors in 
gene transfer studies. Viruses that have been used for this purpose include 

20 retroviruses, adenoviruses, parvovinises, papovaviruses, poxviruses and herpesviruses. 
More recendy, the utility of viral vectors has led to the use of retroviruses and 
adenoviruses in gene therapy plications. 

Although die virus-based delivery systems can give rise to hi^ efficiency of 
gene delivery, they suffer firom a number of disadvantages. For example, the most 

25 widely used viral system, the retroviral vectors, have been extensively modified to 

prevent die generation of rq>lication-conQ)etent retrovirus (RCR), but since such RCR 
. . has the potential to be Irakemogenic (see Donahue et al., J. Exp. Med. 176:1125-1135* 
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1992), all retroviral pieparatioiis for use in gene therapy must undergo extensive 
validation testing to confirm the absence of RCR before use. In addition to diese safety 
concerns, retroviral and other viral vectors can place size and sequence constraints on 
the genetic material that can be transferred and on the target cells that can be infected 
(see, e.g., Israel & Kaufman, Blood, 75:1074-1080, 1990; Shimotohno & Temin, 
Nature 299,265-268, 1982; Stead et al.. Blood, 71:742-747, 1988; and Bodine et al.. 
Blood, 82:1975-1980, 1993). 

The development of efficient non-viral gene delivery (NVGD) systems would 
aUow gene transfer/gene therapy studies to be performed in the absence of the 
aforementioned Ihnitations of die viral vectors, and could also have the advantages of 
ease of scalability, cost and speed of generation. Based on these advantages, non-viral 
gene delivery systems could also allow more diseases to be treated dirough gene 
dierapy making i]gectd>le gene delivery systems a reality. 

Existiiv non-viral gene delivery systems can be rtegUy divided into pl^^sical 
and biochemical approaches. The physical mediods include such techniques as 
electipporation, particle bombardment, scrape loading and calcium phosphate 
transfection (see, e.g., Fechheimer et al., P.N.A.S. 84:8463-8467, 1987; Cbeog et al., 
P.N.A.S. 90:4455-4459, 1993; and Kriegler, M . (ed.), "Gene Transfer and 
E9q)ressiQn, a Laboratory Manual," 1990, W.H. Freeman Publishers). The 
biochemical methods involve mixing the DNA to be delivered with reagents such as 
DEAE-dextran, gramicidin S, l4>osomes, polyamidoamine polymers, polyamines, 
polybrene, cationic proteins and pofy-L-lysine-based coi^ugates (see, e.g., Kawai & 
Nishizawa, Mol. Cell. Biol. 4:1172-1174, 1984; Behr et al., P.N.A.S. 86:6982-6986, 
1989; Rose et al., P.N.A.S. Biotecfaniques 10:520-525, 1991; Pardridge & Boado, 
F.E.B.S. Lett. 288:30-32, 1991; Legendre & Szoka, P.N.A.S. 90:893-897. 1993; 
Haensler & Szoka, Biocoig. Cbem. 4:372-379, 1993; and Wu and Wu, J. Biol. Chem. 
262:4429-4432, 1987). 

These different ^iproaches vary in their efficiency of gene delivery and in their 
ability to confer loqg-tmn (i.e. stable) retention of transferred sequences. However, 
tfse biodiemical qyproadies are in general more attractive from a gene dierapy point of 
view because sudi approaches have a greater potential for use widun injectable gene 
deliveiy systems than do most of die pl^sical s^iproadn. 
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One problem with the use of conjugates based on poIy-L-Iysine or other basic 
polymers, which are assembled via chemical cross-linking, is that the chemical stqps 
required for cross-linkiqg can be both unprecise and cumbersome. Moreover, it can be 
very difficult to control the stoichiometry of the different conjugate components in such 
5 a system, particularly as more components are added to facilitate gene delivery. 

SumnwY ftf ftg favCTtlOT 

In view of die continuing and unmet need for safe, efficient and stable non-viial 

gene delivery systems* the present invention provides a generalized approach for the 
10 ' modular construction of fusion proteins diat are citable of boA bnidiiq; to a 

polynucleotide of mterest, and of facilitating delivery of the bound polynucleotide to a 

target cell, especially to a human target cell for gene therqiy. 

The protems of the present invendon, teemed Gene Delivery Fusion Protrins 

(GDFPs) conqirise a nucleic acid bmdiBg domain (NBD) that contains a compoimi 
15 citable of bindhag die targeted nucleic acid; fused to a gene delivery domain (ODD) 

that contains one or more components that mjjrfj^fp or fi>?ilftffitff delivery of the targeted 

nucleic add to the target cell. 

As described in detail below, nucleic acid binding dcmiains can conqirise any of 

a number of CQnQX>nenls, the essential feature of iirtiich is fliat th^ are equable of 
20 binding nucleic acids. A number of such conqxments are known in the art (see; e.g., 

the references cited below), including proteins diat bind nucleic acids in a sequence- 

specific manner and proteins Oat bind nucleic acids relatively non-specifically. For 

purposes of discussion and illustration, nucleic acid binding domains can be 

conveniently groapcd into either of two basic subsets dependir^ on whedier tlie nudeic 
75 acid binding domain does or does not contain an analog of a sequence-specific nucleic 

acid binding protein, as described in more detail below. . 

In a first ^pe of gene delivery fiiskm protein of die present invention 

(sometimes refmed to herein as a *Type-I GDFP*), die nucleic acid bindipg domain 

contains an analog of a sequence-qiecific nucleic acid binding protein (sequenoe- 
30 specific NHP). b a steond type of gene delivery fusion protein of die present 

invention (sometimes referred to herem as a "Type-n GDFP*), the nudeic add buidiog 

doitnain contains an analog of a sequenoe-non-spedfic nudeic add bindii^ protein 
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(sequence-non-specific NBP) and does not contain an analog of a sequence-specific 
NBP. 

Thus, one embodiment of a GDFP of the present invention is a macromolecule 
useful in delivering a targeted nucleic acid to a taiget cell, comprising a gene delivery 
5 fusion protein (GDFP)» said GDFP conq>rising a nucleic acid binding domain (NBD) 
that contains a component capable of binding to a cognate recognition sequence in the 
taigeted nucleic acid which component is derived from a sequence-specific nucleic acid 
binding protein; fiised to a gene deliveiy domain (GDD) that contains one or mote 
components that mediate or facilitate deliveiy of the targeted nucleic acid to the target 

10 cell. In addition to the binding component derived from a sequence-specific NBP, the 
nucleic acid binding domain of Type-I GDFPs can also contain additional binduig 
components, as discussed below, which can be derived from eidier sequence-specific or 
sequence-non-specific nucleic acid binding pro t eins . 

Anodier enibodnnent of a GDFP of die present invention is a macromolecule 

15 useful in delivering a taigeted nuddc acid to a taigtt cell, consprising a gene deliveiy 
fusion protein (GDFP), said GDFP comprising a nucleic acid binding domain (NBD) 
that contains a conqx>nent cqiable of binding the taigeted nucleic acU ^liiidi 
conqxnmt is an analog of a sequence-ncm-specific nuddc acid binding protein; fiised 
to a gene deliveiy domain (GDD) that contains one or moie con^onenis diat mediate 

20 or facilitate ddiveiy of the taigeted imcleic add to the target cell. 

In one aspect of the invention, the components of the gene deliveiy domain 
(GDD) that facilitate deliveiy of the taigeted nucleic acid to the taiget cell are selected 
from flie group consisting of a binding/taigeting conqxment, a membrane-disrupting 
con^onoxt, a transpoit/localization component and a leplicon integration conqxment 

25 In another aspect of the invention, die various functional domains and conqxments of 
the GDFP are separated by flexible pq>tide linker sequences ("flexons"), which can 
enhance die ability of die con^nents to adopt confoimations relatively indqiendently 
of each other* 

Another embodiment of a GDFP of the invendon is a recombinant 
30 polynucleotide encoding a GDFP. In a preferred embodunem of diis 9pe, die 

polynudeotide is an expression vector and is arranged so that die various domains and 
con^nents of the GDFP are expressed as an in-frame ftision product, thereby allowing 



W 9SA28494 PCT/DS9SA>4738 



-5- 

for efficient modular synthesis of the GDFP as a single recombinant product. Yet 
another embodiment is a method of using the above-described recombinant 
polynucleotide to produce a GDFP, said method comprising the steps of causing the 
recombinant polynucleotide to be transcribed and/or translated and recovering a GDFP. 
S As discussed herein, the preferred mediod involves die modular synthesis of die GDFP 
as a single protein product. Yet another embodiment is a method of using a GDFP to 
deliver a targeted nucleic acid (tNA) to a target cell, die mediod conqnisiiig die steps 
of contacting die GDFP with the targeted nucleic acid to produce a GDFP/nucIeic acid 
conqilex and contacting said GDFP/nucleic acid complex with the target cell. 
10 Preferably, die tNA is an egression vector. Preferably, the target cell is a mammalian 
cell. Yet aiiodier einbodfanem is a ceU produced 1^ die above-described method <rf 
umig a GDFP and die progei^ thereof. 



IS Brief Description of die Drawings 

Figure 1 is a schematic representation of an embodiment of the Gene Delivery 
Fusion Protein (GDFP) concept usn^ a l^pe-I GDFP. 

Figures 2A and 2B are diagrams of die cloning strategy used to generate 
expression vectors encoding IL-2, GAL4, and die GAL4/IL-2 and IL-2/GAL4 GDFPs. 
20 Figure 3 is an SDS-PAGE gel of 3S-S labeled GAL4/IL-2m GDFP. 

Figure 4 is a gel-shift assay showing retention of DNA binding activi^ by the 
GALA/ILr2m GDFP. 

Figure 5 shows retention of IL-2 bioactivity by die GAI4/IL-2 GDFP. 
Figure 6 is an SDS-PAGE gd of 3S-S labeted GAL4/IL-2 GDFP and Qy- 
25 2/GAI>«GDFP. 

Figine 7 shows sequence-specific DNA bindii^ of die GAL4 protein and tbe IL- 
2/GAL4 atnd GAL4/II/-2 GDFPft. 

Rguve 8 shows the cytokme bioactivi^ of ibe IL^2/GAL4 and GAL4/ILr2 
GDFPft. 

30 Figure 9 shows the results of an asssQr demonstrating die ability of GDFPs to 

bind to ILp2 receptor-bearir^ CTLL. 
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Figure 10 shows the results of an assay demonstratiiq; the ability of GAL4/IL-2 
GDFP and IL-2/GAL4 GDFP to mediate bindmg of a target oligomer to IL-2 
receptor-bearixig CTLL. 

Figure 11 shows the results of an assay demonstratiqg the abili^ of GAL4/IL-2 
5 GDFP to mediate binding of a taxget plasmid to IL-2 leceptor-bearing CTLL. 

Detailed Descriptjon of the Invention 
The invention provides a non-viral gene deliveiy system by whidi DNA, RNA 
and/or analogs thereof ("targeted nucleic acid* or "tNA" to be used in gene deliveiy) 
10 are modified by association with a gene delivery fiision protein (GDFP)- The non-viral 
gene delivery system of the present invention conqirises a macromolecular comqilex of 
two sq)arate entities: the targeted nucleic acid to be delivered* and a GDFP. The 
GDFP comprises a nucleic acid bindiqg domain (NBD) that can bind to the targeted 
nucleic acid and thus lead to the fonnation of a GDFP/tNA conq>lex; ftased to a gene 
15 delivery domain (GDD) that can mediate or facilitate the deliveiy of die GDFP/tNA 
complex into the target cells. 

In a preferred embodiment of ttie ijavention die open reading frames gf^wiit^ 
the various GDFP domains and components are fused to enable esqnression of die 
GDFP as a six^e polypeptide. However, the GDFP may also conqirise, for example, 
20 one or more short flexible peptide linker sequence ('flexons") between the individual 
domains and/or components. 

GciWfflPgfinWWB 

The terms "polypeptide", "peptide" and "protein" are used interchapgeably to 
25 refer to polymers of amino acids and do not refier to any particular lengths of the 
polymers. These terms also include post-translationally modified proteins, for 
example, glycosylated, acetylated, phosphoiylated proteins and the likel Also included 
within the definition are, for example, proteins containing one or more analogs of an 
amino acid (including, for exanQ>Ie, unnatural amino acids, etc.), proteins widi 
30. . substituted linkages, as weU as other modifications known in the art, both naturally 
occurring and non-natural]^ occurring. 
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"Native" polypeptides or polynucleotides tcfer to polypeptides or 
polynucleotides recovered from a source occurring in nature. Thus, the phrase "native 
viral binding proteins" would refer to naturally occurriiig viral bindiog proteins. 
"Mutein" forms of a protein or polypeptide are those which have minor 
5 alterations in amino acid sequence caused, for example, by site-specific mutagenesis or 
other manipulations; by errors in transcription or translation; or which are prepared 
synthetical^ 1^ rational design. Minor alterations are those which result in amino acid 
sequences wherein the biological activity of die polypq>tide is retained and/or wherein 
the mutein polypqitide has at least 90% homology with the native form. 

10 An "analog" of a polypqitide X includes firagmenis and mntglng of pofypq)tide 

X that retam a particular biological activity; as wdl as polypq»tkle X diat has bem 
incorporated into a larger molecule (odier flian a molecule widiin which it is normally 
found); as well as synthetic analogs diat have been prepared by rational design. Far 
example, an analog of a DNA binding protein wigfst refer to a portion of a native DNA 

IS bindii^ protein that retains the ability to bind to DNA, to a mutein dneof, to an entire 
native bindir^ protein that has been incorporated into a reconibinaiit fusion protein, or 
to an analog of a native binding protein tiiat has been syndieticalty pr q iar ed by rational 

"Polynucleotide* refers to a polymeric form of nucleotides of any ler^gtii, eitinr 
20 ribonucleotides or deoxyribonucleotides, or analogs tiiereof. This term refers only to 

the primary structure of the molecule. Thus, double- and single*stranded DNA, as well 
as double- and single- stranded RNA are included. It also includes ^oHififrf 
potymicleotides such as methylated or capped polynucleotides. 

An "analog" of DNA, RNA or a polynucleotide, refers to a macramolecule 
.23 resembling naturally-occurring polynucleotides in form and/or function (particular^ in 
the ability to eoigage in sequence-specific hydrc^en bonding to base pairs on a 
conq)lementary poljmucleotide sequence) but whidi differs from DNA or RNA iii, for 
exaiiq)le, the possession of an unuaial or ixm-riatural base or m A 
large variety of such tnolecules have been described for use in anrisense tedmology; 
30 see, e.g., E. Uhhnaim et al. (1990) Chemical Reviews 90:543-384, and the 
publications reviewed tiierein* 
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An "antisense" copy of a particular polynucleotide refers to con4)lenientary 
sequence that is capable of hydrogen bonding to the polynucleotide and may» therefore, 
be capable of modulating expression of the polynucleotide (i.e. by "antisense" 
regulation). Such an antisense copy may be DNA, RNA or analogs thereof, including 
S analogs having altered backbones, as described above. The polynucleotide to which the 
antisense copy binds may be in single-stranded form (such as an mRNA molecule) or in 
double-stranded form (such as a portion of a chromosome). 

A "replicon" refers to a polynucleotide conqirising an origin of replication 
(generally referred to as an sd sequence) which allows for rq>lication of the 

10 polynucleotide in an appropriate host cell. Exanqiles include rq>licons of a target ceU 
into which a desired nucleic acid might integrate (in partkular* nuclear and 
mitochondrial chromoscnnes; and also extradiromosomal rqilicons such as plasmids). 

"Recombinant," as aiq>lied to a polynucleotide, means that llie polynucleotide is 
the product of various combinations of cloniqg, restriction and/or ligation stqss 

IS resulting in a constract that is distinct fixim a polynucleotide found in na 

"Recombinant" may also be used to refer to tiie protein product of a recombinant 
polynucleotide. Typically, DNA sequences encoding tiie structural codii^ sequrace 
for, e.g., components of the NBD and GDD, can be assembled from cDNA fragments 
and short oligonucleotide linkers, or firom a series of oligonucleotides, to provide a 

20 synOetic gene which is capable of beiqg esqpressed when operably linked to a 

transcriptional regulatoiy region. Such sequences are preferably provided in the form 
of an open reading firame unintemqyted by internal non-translated sequences (i.e. 
"introns"), such as those commonly found in rakaiyotic genes. Such sequences, and 
all of the sequence referred to in die context of die present invention, can also be 

25 generally obtained by PCR amplification using viral, prokaryotic or eukaryotic DNA or 
RNA templates in conjunction with appropriate PCR anq>lin>ers. 

A "recombinant expression vector" refers to a polynucleotide which contains a 
transcrq>tional regulatoiy region and coding sequences necessary for die e3q)ression of 
an RNA molecule and/or protein and wlash is capable of beiqg introduced into ai target 

30 • cell (by, e.g., viral infection, transfection, electrop or ation or by die non-viral gene 

delivery (NVGD) tediniques of die present inventioiO* A fiirdier exanqile would be an 
e3q)res8ion vector used to express a GDFP of the present invention. 
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"Recombinant host cells", "host cells", "cells", "taiget cells", "cell lines", "cell 
cultures", and other such tenns denote higher eukaiyotic cells, most preferably 
mammalian cells, which can be, or have been, used as recipients for lecombinant 
vectors or other transfer polynucleotides, and include the progeny of die original cell 
5 which has been transduced. It is understood diat the progeigr of a single cell may not 
necessarily be completely identical (in morphology or in genomic or total DNA 
conqplement) to the original parent cell, due to natural, accidntal, or deliberate 
mutation* 

An "open readiqg frame" (or "ORF") is a region of a polynucleotide sequence 

10 that can encode a polypeptide or a portion of a polypeptide (i.e., die region may 

represent a portion of a protem codiiig sequence or an entire protein coding sequence), 

"Fused" or "fiision" refers to die joining tog^her of two or more elements, 
conqxments, etc., by whatever means (inchidimg, for exanqple, a "fiision protein" made 
by diemical conjugation (whether covalem or non-covalent), as well as the use of an 

15 in-fiame fusion to generate a "fusion protein" by recombinant means, as 

infta). An "in-frame fusion" refers to die joining of two or more open readiqg 

frames (ORFs), by recombinant means, to form a single larger ORF, in a manm that 
maintains the correct reading frame of die original ORFs. Thus, the resultiog 
recombinant fusion protein is a single protein containing two or more segments ttmt 

20 correspond to polypeptides encoded by the original ORFs (which segments are not 
normally so joined in nature). Although the reading frame is thus made continuous 
throughout the fused segments, die segments may be physicaUy squuated by, for 
exanq>le, in-frame flexible polypeptide linker sequences ("flexons"), as described infra. 
A "flexon" refers to a flexible polypeptide linto sequence (or to a iniclei^ 

25 sequence encoding such a polypeptide) iKiiich ^ically conqirises amino acids haviog 
small side dudos (e,g., glycine, alanine, valine, leudne, isoleudne and serine). In the 
present invention, flexons can be incorporatBd in die ODFP between one or more of die 
various domains and components. Incorporating flexons b et we en these components is 
believed to promote funcdonality by allowing ihem to adopt c onfor matians relatively . 

30 independently from each odier. Most of the amino acids inc o rpor a ted into the flexon 
will preferabty be amino acids havii^ small side chains. The flexoii will prefiBtabfy 
comprise between about four and one hundred ammo acids, more prefoably between 
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about eight and fiffy amino acids, and most piefeiably between about ten and thirty 
amino acids. Flexon ("Pixy") sequences described in U.S. Patents 5,073,627 and 
5,108,910 will also be suitable for use as flexons. 

A "uranscriptional regulatoty region" or "transcriptional control region" refiers to 
5 a polynucleotide encompassiiig all of the cis-acting sequences necessary for 
transcription, and may include sequences neoessaiy for regulation. Urns, a 
transcriptional regulatoty region includes at least a promoter sequence, and may also 
include other regulatoty sequences such as enhancers, transcription factor binding sites, 
polyadenylation signals and spUcii^ signals. 
10 "Operably linked" refers to a juxtq)osition wherein the components so described 

are in a relationship pennittiiig diem to function in Aeir intended manner. For 
instance, a promoter sequence is operabl^y linked to a codiqg sequence if the pramoier 
sequence promotes transcrq)tion of the coding sequence. 

"Transdiiction," as used herein, refers to the introduction of an exogenous 
IS polynucleotide into a host cell, irrespective of the metfiod used for the insertion, which 
methods include, for exanq>le, transfection, viral infection, transfonnation, 
electroporation axid the non->viral gene delivny techniques of the present invention. 
The introduced polynucleotide may be stabfy or transiently maintained in the host cell. 
Stable maintenance ^ically requires Oat die introduced pofynudeotide either contains 
20 an origin of replication compatible with the host cell or integrates into a replicon of the 
host cell such as an extrachromosomal rq>licon (e.g. a plasmid) or a nuclear or 
mitochondrial chnmiosoiitt. 

"Retroviruses" are a class of viruses which use RNA-directed DNA polymerase, 
or reverse transcriptase, to rqiUcate a viral RNA genome resulting in a double-stranded 
25 DNA intermediate which is in c oip or ated into chromosomal DNA of aii avian or 

mammalian host cell. Many such rttroviruses are known to diose skilled fai die art and 
aredescribed,forexamDle. m Weiss etal.>eds. RNA Tumor Viruses. ^ 
Spriiv Harbor, New Yoik (1984 and 1985). Plasmids containiog retrovtaal genomes 
are also widely available, from die American Type Culture Collecticni (ATOC) and. 
30 other sources. The nucleic acid sequences of a large number of these viruses are 

known and are generaUy available, for exanq>le, fitom databases such as GENBANK. 
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A "sequence-specific nucleic acid bindipg protein" is a protein diat binds to 
nucleic acids in a sequence-specific mannert i.e., a protein that binds to certain nucleic 
acid sequences (i.e. "cognate recognition sequences", infira) with greater aifini^ than to 
other nucleic acid sequences. A "sequence-non-specific nucleic acid binding protein" is 
a protein that buids to nucleic acids in a sequence-non-specific manner, i.e. a protein 
that binds generally to nucleic adds. 

A "cognate" receptor of a given ligand refers to the receptor normally cspMc 
of binding sudi a ligand. A "cognate" recognition sequence is defined as a nucleotide 
sequence to which a nucleic acid binding domain of a sequence-specific nucleic acid 
binding protein binds wiA greater afiBnity tfian to other nucleic acid sequences^ A 
"cognate" interaction refers to an intennolecular association based on such types of 
bindiqg (e.g. an association between a receptor and its cognate ligand, and an 
association between a sequence-specific nucleic acid binding protein and its cognate 
nucleic add sequence). 

"Gene delivety" is defined as die introduction of targeted nucleic add into a 
target cell for gene transfer aiKl may encompass targetingAiinding, iqstake, 
transport/localization, replicon integration and e3q)ression. 

"Lyn:q)hocytes' as used herein, are spherical cells widi a large round nucleus 
(which may be indented) and scanty cytoplasm. Th^ are cells Oat specificaUy 
recognize and respond to non-self antigens, and are responsible for development of 
specific immunity. Included widiin "lymphocytes" are B-lympho^tes and T- 
lymplio^tes of various classes. 

"Lymphohematopoietic stem cells" are cells which are ^ically obtained finnn 
the bone marrow or perq)heral blood and which are capable of giving rise, diroughcell 
division, to ai^ mature cells of die lyiiq>hoid or hematopoi^ systems.. This term 
includes committed progenitor cells widi significant dumgh limited ciqpad^ for self- 
renewal, as well as die more primidve cells sudi as those capable of fbmung spleen 
colonies in a CFU-S assay, and still more primitive cells possessing loqg-teim and/or 
multilineage re-populating ability in a transplanted mammalian host* 

"Lymphohematopoietic cells" include die various mature cells of die ^ympiioid . 
or hematopoietic systems (including lymphocytes and other blood cells), as well as 
lymphohematopoietic stem cells. 
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A "primaiy culniie of cells" or "primaiy cells" refer to cells which have been 
derived duectly from in vivo tissue and not extensively passaged. Primaiy cultures can 
be distinguished from cell lines and established cultures principally by the retention of a 
karyotype which is substantially identical to the kaiyo^e found in the tissue from 
5 which the culture was derived* and by the cellular responses to manipulations of the 
envux)nment which are substantially similar to the Jn vivo cellular responses. 

As is described in detail below, the non-viral gene deliveiy conq)lexes of the 
present invention comprise gene delivery fusion proteins (GDFPs) that bind targeted 
nucleic acid through a nucleic acid bmding domain (NBD) and facilitate gene delivery 
10 through a gene deliveiy domain (GDD). Each of these domains can conq>rise a 
number of different functional components and sub-components. Some of these 
potential components are summarized in the following list: 

NQN-mA]U gENE PPI.yvERY COMP^EK (the "gpgP/tWA Compter 

15 

1. Gene Peli verv Fusion Prtitein rGDFP) 

A. Nucleic Acid Binding Domain (NBD) 

(1) Nucleic Acid Binding (NB) conqxment 

(2) Other possible conqx>neiits (e.g. mediating conqnession of tNA) 
20 B. Gene Deliveiy Domain (GDD) 

(1) Binding/Targeting (B/T) component 

(2) Membrane-Disnq)ting (M-D) component 

(3) Transport/Localization (T/L) component 

(4) Replicon Integration (RI) component 

25 

2. Targeted Niidric Add tftNA^ 

A. BindiiQ shes fiir Oe GDFP (see infia) 

B. Sequence of interest (e.g. gene to be delivered) 

C. Other possible sequences (e.g. selectable markers) 

30 

Each of Aese domains and components, as well as additional elements that may 
be inchidf^, are defined and described m detail bdow. 
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The practice of (he present inventioii will employ, unless otherwise indicated, a 
number of conventional techniques of molecular biology, microbiology, recombinant 
DNA, and immunology, which are within the skill of Oe art. Such techniques are 
explained fully in the literature, see, e.g., Kriegler, M. (ed.), "Gene Transfer and 
S Esqiression, a Laboratory Manual," (1990), W.H. Freeman Publishers; Sambrook, 
Fritsch, and Maniatis, "Molecular Cloniqg: A Laboratory Manual," Second Edition 
(1989); P.M. Ausubel et al. (eds.), "Current Protocols in Molecular Biolpgy," (1987 
and 1993); M.J. Gait (ed.), "Oligonucleotide Synthesis," (1984); ILL Freshn^ (ed.), 
"Animal Cell Culture," (1987); J.M. Miller and M.P. Calos (eds.), "Gene Transfer 

10 Vectors for Mammalian C^lls," (1987); D.M. Weir and C.C. BlackweU (eds.), 
"Handbook of Bqierimental Immunology;" J.E. Coligan, A.M. Kruisbeek, D.H. 
Margulies, E.M. Shevach and W. Strober, (eds.)t "Current Protocols in Immunology," 
(1991); and the series entided "Mediods in Enqmoolpgy," (Academic Press, Inc.). AU 
patents, patent aj^licadons, and publications moitioDed hneiii, bodi supra and in&a, 

15 are hereby incorporated herein 1^ reference. 

Illustrations of Tvne-I Gene Ddivgrv Fusion Protciiig 

The Gene Delivery Fusion Protein / Targeted nucleic acid Comple x (GDFP/tNA> 
20 One concq»t of the present invendon is to create recombinant gene delivery 

fusion proteins (GDFPs) that are able to bind to a cognate recognition sequence in a 
targeted nucleic acid (tNA) and facilitate delivery of die tNA into a target cell. The 
GDFPs bind targeted nucleic acid through a nucleic acid bindirig domain (NBD) and 
facilitate gene delivery duougji a gene delivery domain (GDD). 
25 Thus, in the context of the present invention, targeted nucleic acids can be 

delivered via one or more st^ that are mediated or augmented by GDFPs. In 
particular, die gene delivery process can inchide one or more of the foUowiqg steps: 

(1) binding and/or targetipg of the GDFP/tff A complex to the surfeoe of a target cell; 

(2) Intake of the OfA (widi or widiout die GDFP) by die target cdl; 0) intraceUular 
30 transport and/or localization of the tNA to an organelle such as a nucleus or 

mitochondrion; atKi (4) integration of die tNA into a celtular replicon such a 
diromosoime. A particular GDFP need not tKcessarQy perform all of ttiese functions. 
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For exaiiq)le, a GDFP intended to deliver an e>q>iession vector to the nucleus of a cell 
could be constructed to contain: (i) an NBD capable of binding to a cognate recognition 
sequence on the expression vector and; (ii) a GDD havipg only a transport/localization 
component such as a nuclear localization sequence. Such a GDFP could then be 
5 conq)lexed with targeted nucleic acid and introduced into target cells by a transducticm 
method such as electroporation. The GDFP would dien fi>cifitate tianapod/localization 
to the nucleus^ periiaps to a specific site in a r^licon, and thus enhance expiession of 
the vector. Alternatively, for exanq)le, the aforementioned GDD could be modified to 
include a bindiiig/targetipg component and a membrane-disniptipg ccmqponent. Usiqg 

10 such a GDD, the GDFP/tNA conqilex could be directed to a particular cell type widiin 
a population of cells, and uptake of the comploc could proceed widunit the need f<nr, 
e.g., electroporation. Use of the GDFPs in conjunction widi techniques sudi as 
electroporation, as in the former example^ would of course be more appropriate for in 
vitro gene delivery. Use of GDFPs as described in die latter exanq)le could be readily 

IS applied to the delivery of genes eiAer in vitro or in vivo. Similarly, the GDFP/tNA 
complexes could be used as admixtures with odier proteins or simple chemicals tl«^t 
enhance gene delivery. This could include, for exaiiq)le, enhancing the intake of 
GDFP/tNA complexes by adding membrane disrupting agents m trans. 

Other combinations of components can be prepared (and particular versions of 

20 the components can be selected) according to die specific design objectives of die gene 
delivery scheme. These objectives include, for exaoqile, the location of the cells to be 
targeted, the desired cellular specificity of targeting, and the desired sulxellular 
destinatkm of die fl^IA. 

The individual doinaiiis aiid conqxments of the GDFP/tNA con^le^ 

25 constraction and assembly are described in more deadl bdow. . 

t. The Gere Deliverv Fusion Protein <GDFP\ 

The GDFP cosqnises two major domains, a nucleic .acid bindii^ domain (NBD) 
and a gene delivery domain (ODD). Each of these major domains conqxrises one or 
30 . more conqxments fiicilitadng nucleic acid binding and gene delivery, respectively. 

These individual components may be derived from daturalfy-occurriqg proteins, or tb^ 
inay be syndietic (e.g. an analog of a naturaUy-oocurriog ccnnponent). Typical^, 
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cloned DNA encoding various coQ^x)nents will already be available as plasmids - 
although it is also possible to synthesize polynucleotides encoding the components 
based upon published sequence information. Polynucleotides encoding the conqxments 
can also be readily obtained using polymerase chain reaction (PCR) mediodology, as 
5 described, for example, by MuUis and Faloona (19S7) Metfa. Enqrmology 1S5:33S. 
In the construction of the GDFP, discussed in more detail below, DNA 
sequences encoding the domains and their various components are preferably fused 
in-firame so that the GDFP can be conveniently synthesized as a single polypeptide 
chain (i*e« not requiring fturther assenibly). The various domams and components can 
10 also be separated flexible peptide Unker sequences called "flexons" whicb are 
defined in more detail above. 

A. The Nucleic Acid Binding nm«5i«i r^^p) 

A nucleic acid binding domain is a lei^gib of polypeptide capable of bindiqg 

IS (either durectly or indirectly) to die targeted nucleic acid (Of A) widi an affinity 

adequate to allow the gene delivery domain of the GDFP to mediate or angmgnt the 
deliveiy of the tNA into a target cell. Most conveniently, the NBD will bind direct^ 
to the tNA without the need for any imermediaiy bindiog eionenL 

hi l^re-I GDFPs, the NBD contains a sequence-specific bindipg conqxment diat 

20 is an analog of a sequence-specific nucleic acid bmding prcMein. Inonepxefemd 

embodiment of this type, the component allows the nucleic acid bindii^ by the NBD to 
be sequence-specific with respect to the tNA, in which case the NBD may bind to a 
specific cognate recognition sequence widiin the tNA; &s is illustrated in Figure 1. 

As described herein, one particular advantage of the Type I GDFP q)proach is 

25. that it not only allows the stoichiom^ric attachment of delivery c om p on e n ts to die tNA, 
but also allows ^ GDFP to be positioned at pre-detezmined locations widi respect to 
the tNA. For exanq)le, die positioning of NBD cognate recognition sequences in 
proximhy to terminal integrase recognition sequences can fiicilitaie die use of GDFPft to 
mediate integration, as desoibed below.. 

30 . . The NBD may comprise* for example, a known nucleic acid bindiqg protein, or 
a nucleic acid bindmg r^ion thereof • The NBD may also comprise two or more 
nucleic acid bindiqg regions derived firom die same or different nucleic acid binding 
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proteins. Such multimerization of nucleic acid binding regions in the NBD can allow 
for the mtexaction of the GDFP with the targeted nucleic acid to be of desirable 
specificiQr and/or higher afifini^. This strategy can be used alone or in combination 
with multimerization of recognition sequence motifis in the tNA to increase binding 
S avidi^, as discussed below. 

DNA encoding the NBD domain of the GDFP may be obtained from mar^ 
different sources. For example, many proteins that are capable of binding luicleic acid 
have been molecularly cloned and their cognate target recognition sequences have been 
identified (see, e.g., MitcheU & Tjian, Science 245:371-378, 1989; Pabo St Sauer, 

10 Ann. Rev. Biochem. 61:1053-1095, 1992; Harrison, S.C., Nature 353:715-719, 1991; 
Johnson & McKnight, Arm. Rev. Biochem. 58:799-839, 1989; and references reviewed 
therein, hereby incorporated by reference). Such sequence-specific binding proteins 
include, for example, regulatory proteins such as those involved in transcription or 
nucleic acid replication, and typically have a modular construction, consisting of 

15 distirx:t DNA binding domains and regulatory domains (see,, e.g., Struhl, Cell 49:295- 
297, 1987; Frankel and Kim, CeU 65:717-719, 1991; and Pabo & Sauer, Ann. Rev. 
Bipchem. 61:1053-1095, 1992; and references reviewed therein, hereby incorporated 
by reference). A number of families of such nucleic acid binding proteins have been 
characterized on the basis of recurring structural motift ir^luding, for exanqile, 

20 Helix-Tum-HeUx proteins such as the bacteriophage lambda d r e pr e sso r ; 

Homeodomain proteins such as die T>msnphiia Antermapedia regulator; the POU 
domain present in proteins such as die mammalian transcripdon fiictor Oct2; Zinc 
finger proteins (e.g. GAL4); steroid receptors; leucme rqiper proteins (e.g. GCN4, 
C/EBP and c-jun); beta-sheet modfiB (e.g. the pndcatyotic Arc r e pr essor); and odier 

25 ftmilies (inchiding serum response fiictor, oncogenes such as cnnyb, NFkB and vel, 
and odiefs); see, e.g., Pabo & Sauer, Aim. Rev. Biochem. 61:1053-1095, 1992, and 
references reviewed dierein, herAy incorporated by r e f e r e n ce. 

For many of these protehis, the nucleic acid biiKliqg domains have been mapped 
m detail; and, for a riumber of sudi doniains, recombinaiit fusions 

30 sequences have been made arid shown to rmin die binding acdvities of the parei^ 
DNA bmdiog domauL For example, in die case of die yeast-derived trans cr i p t iona l 
activator GAL4, the DNA binding domain has been defined, and fusions of diis dinnain 
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to heterologous adjoining sequences have been maide that retain DNA sequence-specific 
bindiog activiQr (Keegan et al.. Science 231:669-704, 1986; Ma Jk Ptashne, Cell 
48:847-853, 1987). This ability to functionally "sw^>" binding domains has also been 
shown for a number of other DNA binding proteins, including, for example, the E, ooli 
5 lex A repressor (Brent and Ptashne, Cell 43:729-736, 1985), the yeast transcrqytional 
activator GCN4 (Hope and Struhl, Cell 46:885-894, 1986), the bacteriophage lambda 
cl repressor (Hu et al.. Science 250:1400-1403, 1990), the mammalian transcrq>tion 
factors Spl (Kadonaga et al., CeU 51:1079-1090, 1987) and C/EBP (Agre et al.. 
Science 246:S)22-926, 1989). Similariy, fimctional swapping has been rqx>rted in the 

10 nuclear DNA-binding steroid hormone recq)tors (see, e.g.. Green and Chandxm, 
Nature 325:75-78, 1987). See also, e.g., Khig & Rhodes, Trends Biochem. Sci. 
12:464-471, 1987; Berg, Cell 57:1065-1068, 1989; Wasylyk et al.. Ear. J. Biochem. 
211:7t18, 1993; Faisst & Mqrer, Nud. Acids Res. 20:3-26, 1992; Stnihl, Trends 
Biochem. Sci. 14:137-140, 1989; and Nelson A Saner, CeU 42:549-558. 1985. 

15 Sequence-specific nucleic acid bindiqg proteins can exhibit a range of bindiiig yfBnifieff 
to different cognate nucleic acid sequences in vitro (see, e.g., Vashee et al., J. 
BioLChem 268:24699-24706, 1993). 

Virally encoded nucleic acid binding proteins can also be used in the present 
invention. These include, for exanq>le, the adenovirus E2A gene product, which can 

20 bind single-stranded DNA, double-stranded DNA and also RNA (Oeghon «t al.. 

Virology 197:564-575, 1993, and references cited therein); die retroviral IN proteins 
(Krpgstad & Champoux, J. ^^1 64: 2796, 1990); the AAV 68 and 78 protems 
(Owens et al., J. Vkol 67: 997, 1993); and die SV40 T antigen (Arthur al.« J. 
Virol., 62:1999-2006, 1988). The ceUular p53 gene product, which binds T antigen^ is 

25 also a DNA bmdmg protein (FUnk et al., Mol. Cell Biol., 12:2866-2871, 1992). 

Similarly, RNA bindipg proteins have been identified and Aeir inclusion in the 
NBD would associate Oe GDFP widi a targeted RNA and thereby achieve RNA 
deliveiy mediated by the gene deliver domam pf die GDFP* RNA binding proteins 
duit can be used in the context of the presem invention include, for exampfe^ 

30 , and Rev proteins of HIV; see, e.g., Til^ et al., P.N.A.S. 89:758-762, 1992; and 

CuUen et al.. Cell 73:417^20, 1993. Shnilaily, ceDular RNA bindiqg protems, such 
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as the interferon-inducibte 9-27 gene product (Constantoulakis et al.. Science 259:1314- 

1318. 1993), can also be used. 

Nucleic acid bindii^ domains of Type-I GDFPs can also contain (in addition to 

a component derived from a sequence-specific nucleic acid binding protein) one or ' 
5 more components that are derived from sequence-non-specific nucleic acid bindli^ 

proteins. Such sequence-non-specific binding proteins include, for exanq>le, hi-grftTigg 

(von Holt, Bioassays 3:120-124, 1986; Rhodes, Nucleic Acids Res. 6:1805-1816. 

1979; Rodriguez et al., Biqphys. Chem. 39:145-152, 1991); proteins such as nucleolin 

(Erard et al., Eur. J. Biochem. 191:19-26, 1990); polybasic polypqitide sequences such 
10 - as pofy-Lrlysine (U et al.. Biochemistry. 12:1763-1772 1973; Weidcx^f and Li, 

Biopolymers 16:669-684, 1977). avidin (Panbidge & Boado. F.E.B.S. Lett. 288:30-32. 

1991); the non-histone high mobili^ group proteins and other protems (see. e.g.. Pabo 

& Sauer. Ann. Rev. Biochem. 61:1053-1095. 1992, and references reviewed dierein); 

that interact non-specifically widi nucleic acids. Ottier proteins binding nucleic acid in 
15 a sequence-non-specific fashion include retroviral nucleocapsid (NQ proteins <see. 

e.g.. Gel£md et al.. J. Biol. C3iem.. 268:18450-18456. 1993). 

Thf G^pe P^Hv^ry Domam (QPP) 

The GDD portion of the GDFP contains one or more polypeptide regions duit 
20 mediate or augment the efficient of gene delivety. Such sequences may include, for 
exanqple, binding/targeting conqxments. membrane-disruptlqg components, 
transport/localization conqKments, and rq>licon integration coaqx>nent8. as discussed 
below. 

A particular GDD need not contain a component rqnesentiQg each of the 
25 aforementioned types. . Conversely, a GDD may contain more dum a single component 
of a given type to obtain the desired activi^. Moreover, a particular segment of a 
GDDi niigfat serve die function of two or more of tliese co m pone n ts. F6r exanqple. a 
sir^ region of a polypeptide might fonction bodi in bindii« to a cell surfisoe and in 
disruption of die menibrane at that surfine. 



30 
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m Binding/Targeting ffim Components 

Binding/targetuig components are regions of polypeptides that mediate bindix^ 
to cellular surfaces (which bindii^ may be specific or non-specific* direct or indirect). 
Any protein that can bind to the surface of the desired target cell can be employed as a 
5 source of B/T components. Such proteins include, for example, Uganda such as 

cytokmes that bind to particular cell surftoe receptors, antibodies, lectins, viral binding 
proteins, cellular adhesion molecules, and any other proteins that associate with cellular 
surfaces. The "receptors" for these binding proteins include but are not limited to 
proteins. Moreover, fhe receptors may, but need not, be specific and/or restricted to 

10 certain cell types. Essentially, the B/T conq)onents can be prqiared from ai^ l^gand 
that binds to a ceU surface molecule. 

By way of illustration, one group of proteins from which the B/T components 
can be derived are cytokines. Cytokines are intercellular signallmg molecules, the best 
known of which are involved in die regulation of mammalian somatic cells. Several 

15 families of cytokines, both growtti promoting and growth inhibitory in their effiscts, 
have been characterized. Thus, a B/T component can campiisc an amino acid 
sequence containing at least that portion of a cytokine polypeptide that is required for 
binding to receptors for die cytokine on die surface of mammalian cells, or a mutetn of 
such a portion of a cytokine polypeptide. A B/T conqx>nent derived from a cytokine 

20 can, but need not, also contain the portion of die cytokine that is involved in "cytokine 
eflfector activi^/ as described below. 

Exanoples of cytokines diat can be used in the present invention include, tot 
example, otterieukins (such as IL-lo, ILrl/9, IL-2, IL-3, IL^, IL-S, IL-6, IL-7, IL-9 
(P40!), IL-10. IL-11, 11^12 and IL-13); CSF-lype cytokines subh as OM-CSF, GCSF, 

25 M-CSF, UF, EPO, TNF-o and JNF-P); intorferons (such as IFN^, IFN-^« IFN-r); 

cytokines of the TGF-iS family (such as TOPrffU TGP-ffl^ TOF-^, inhibui A, inhihhi 
B,. actlvin A, acdvin B); ciiemotactic factors (sudi as NAP-1, MCP-1, MlP^la, 
MlP'lfi. MIP-2, SlSfi^ SISa, SIS€t FP-4. FBP. yIP-10, MGSA); growdi factors (such 

. . . as EGP, TGF-d, aFGF, bF(3F, KGF, FDGF-A, PDGP-B, PD-ECXjF, INS, IGF-I, 

30 IGF-n, NGFwS); cHype inte r crin e cytoldoes (sudi as IL-8, GRO/MGSA, 

PBP/CTAP/jSTG, IP-IO, MIP-2. KC, 9B3); and ff-typt in t e r cr ine q^txddnes (sudi as 
MCAF. ACT-2/PAT 744/(326, LD-78/PAT 464, RANTES, G26, 1309, JE; TCA3, 
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MIP-lat/3« CRG-2). A number of other cytoldiies are also known to those of skill* in 
the art. The sources, characteristics, targets and effector activities of these cytokines 
have been described and, for many of the cytokines, the DNA sequences encoding the 
molecules are also known; see, e.g.. Van Snick, J. et al. (1989) J. Exp. Med. 169: 
5 363-368; Paul, S.R. et al. (1990) Proc. Nad. Acad. Sci. USA 87: 7512-7516; Gately, 
M.K. et al. (1991) J. ImmunoL 147: 874-882; Minty, A., et al. (1993) Natuie 362: 
248; and the reviews by Aral, K., et al. (1990) Anmi. Rev. Biochem. 59:783-836; and 
Oppenheim, J.J., et al. (1991) Annu. Rev. Immunol. 9:617-48; Waldman, T.A. (1989) 
Annu. Rev. Biochem. 58:875-911; Bcutler, B., et al. (1988) Annu. Rev. Biocfaem. 

10 57:505-18; Taniguchi, T. (1988) Annu. Rev. ImmunoL 6:439-64; Paul, W.E. et al., 
(1987) Annu. Rev. Immunol. 5:429-59; Pestka, S. et al., (1987) Annu. Rev. Biochem. 
56:727-77; Nicola, N.A. et al. (1989) Annu. Rev. Biochem. 58:45-77; and Schrader, 
J.W. (1986) Annu. Rev. Immunol. 4:205-30; and die particular publications reviewed 
and/or cited dierein, which are hereby incorporated by reference in their entity. 

15 Many of the DNA sequences encoding cytokines are also generally available from 
sequence databases such as GENBANK. Typically, cloned DNA encoding such 
cytokines will already be available as plasmids - although it is also possible to 
synthesize polynucleotides encoding the cytokines based i^>on the published sequelae 
information. Polynucleotides encoding the ^tokines can also be obtained using 

20 polymerase chain reaction (PCR) methodology, as described, for exairple, by Mullis 
and Faloona (1987) MedL Enzymolpgy 155:335. The detection, purification, and 
characterization of ^tokines, including assays for identifying new cytokines effective 
upon a given target cell type, have also been desoibed m a number of publications, 
inchidiQg, e.g., Clemens, M J. et al. (eds.) (1987) "Lymphcddnes and Interferons, 

25 IRL Press, Oxford; and DeMa^yer, E., et al. (1988) .*lnietferons and Odier Regulatory 
Cytokines," John Wiley & Sons, New Y<nk; as well as the references referred to 
above. 

The Ugaiids suitable for targetii^ a particular sub-population of cdls will be 
those ^iduch bind to receptors present on cells of that sub-population. Again, taking 
30 cytokines as an example, the target cells for a large number of Aese molecules are 
already known, as noted above; and, in many cases, the paTtiaitar cell surfeoe 
recqytors for the ^tokme have alrea^ been idemified and characteiized; see, e.g., die 
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publications referred to above. Typically, the ceU sur&ce receptors for cytokines are 
transmembrane glycoproteins that consist of either a single chain polypq>tide or 
multiple protein subunits. The receptors generally bind to their cognate ligands with 
high affinity and specificiQr, and may be widely distributed on a varied of somatic 
S cells, or quite specific to given cell subsets. The presence of cytokine recq>tors on a 
given cell type can also be predicted fitom the ability of a cytokine to modulate the 
growth or other characteristics of the given cell; and can be determined, for exanq)le, 
by monitorixig the bindiiQ of a labeled cytokine to such cells; and otiier techniques, as 
described in die references cited above. 

10 Thus, for example, a large mnnber of cytoldne receptors have been 

characterized and many of these are known to belQi« to recqytor families which share 
sunilar structural motifis; see, e.g., the review 1^ M^nna, A., et a!., Ann. Rev. 
Iihmunol. 10:295-331 (1992), and the publications reviewed therein, hei^ 
incorporated by reference. Type-I cytokine receptors (or hematopoietic growdi fitttor 

15 receptors) inchide, for example, the recqptors for IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, 
GM-CSF, O-CSF, EPO, CNTP and UF. l^pe-U cytokine receptors inchide, for 
example, the receptors for IFN-d, IFN-/3 and IFN-y. Type-m cytokine receptors 
include, for example, the recq>tor5 for TNF-o, TNF-/J, FAS, CD40 and NQF. Type- 
TV cytokme receptors (immunoglobulin-like, or "Ig-like,* recqptors) inchide die 

20 recq>tors for IL-1; and die receptors for IL-6 and G-CSF (which have Ig-like motift in 
addition to die l>pe-I motiO* These recqitor fiamilies are described for example, m 
Smifli et al.. Science 248: 1019-1023, 1990); Larsen et al., J. Exp. Med., 172: 1559- 
1570, 1990); McMahan et al., EMBO J. 10:2821-2832, 1991); and in the reviews by 
Cosman et a!.. Trends Biochem Sci 15: 265-269, 199(9; and Miyiyhna, A., et al., 

25 Arm. Rev. fmmunoL 10:295-331 (1992), and die publications reviewed therein, aU of 
wbiOi are hereby incor porated liy referenee. As new cytxddnes are characterized, diese 
can be daaplayed in die present invention as long as they exhibit die desired bindnqg 
characteristics and specificity. The identification and duoacterization of cytokuoes, and 
die use of assays to test die abili^ of qytpUnes to activate particular target cells, are 

30 known in die art; see, e.g., Clemens, M.J. et al. (eds.) (1987) *Lynq>hcridnes «iid 
Interfmns," IRL Press, QxfiEnd; and DeMaqrer^ B., et al. (1988) "Interferons aixl 
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Other Regulatoiy Cytokines/ John Wiley & Sons, New Yoik; as well as the references 
referred to above. 

The choice of a particular ligand will depend on the presence of cognate 
receptors on the desired target cells. It may also depend on the correspondii^ absence 
5 of cognate receptors on other cells which it may be preferable to avoid targetix^. Widi 
the cytokines, for exanq>le, the role of particular molecules in die regulation of various 
cellular systems is well known in the art. In the hematopoietic system, for exanq>le, 
the hematopoietic coloiq^-stimulating factors and interieukins regulate the production 
and function of mature blood-forming ceUs. Lymphocytes are dependent upon a 

10 number of cytokines for proliferation* For example, cytotoxic T lynqphocytes (CTLs) 
are dependent on helper T (Th) cell-derived cytokines, such as IL-2, for growth and 
proliferation in response to foreign antigens. (Zinkemagel and Doher^, Adv. 
Immunol. 27:S1, 1979; Male et al.. Advanced bnmunology. Chap. 7, Gower PubL, 
London, 1987; Jacobson et al., J. Immunol. 133:754, 1984). IL-2, for exanq)le» is a 

IS potent mitogen for CTLs (Gillis and Smith, Nature 268:134, 1977), and the 

combination of antigen and IL-2 cause proliferation of primary dM*** T cells in vitro. 
The inq>ortance of IL-2 for the growth and maintrnanrc of the CDS"^ CTL in vivo has 
been documented in models of adoptive immunodiecqyy in which the therapratic 
efficacy of transferred anti-retroviral €08*^ cells is enhanced on subsequnt 

20 admhiistration of ILr2 (Cheever et al., J. ^xp. Nfed. 155:968, 1982; Redddiase et al., 
J. ViroL 61:3102, 1987). IL*4 and IL-7 aie also csqiable of stimulating die 
proliferation of mature CD8'*' CTL (Aldmon et al., J. Exp. Med. 172:577, 199Q). In 
the case of IL-2, the IL-2 recqitors are eaqncssed on T-cells, B-cells, natural Idlln 
cells, glioma cells and ceUs of die mono^te lineage (finidi, Scieine 240:1169, 1988). 
. 25 However, the greatest level of higli afBni^ IL-2 nccpuxr oqmssion is observed hi 

activated T-ceUs (Waldenuum, Ann. Rev. Biocbem. 58:875, 1989). The IL-2 mxpbx 
conq>lex consists of duee protein conqKinents, a low affinity r eoq uor , a, Tac or p55 
(Lecmaid et al.. Nature 311:626, 1984), an intermediate affini^ recqnor, fi or p70 
(Hatakeyama, Science 244:551, 1989), and a signal transduction protein, y or p64, . 

30 which interacts widi the p70 receptor subunit CTakesbita et al.. Science 257:379, 1992). 
The conabination of die o and fi subunits togcAa make up a higb affini^ fimn of die 
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11^2 receptor (Hatakeyama, Science 244:551, 1989); a-ff-y combinatioiis appear to 
have the highest afTmity (Asa et al.. P.N.A.S. 90:4127-4131. 1993). 

Thus, for example, a GDFP including ILr2 can be used to target gene deliveiy 
specifically to activated T lymphocytes which express high levels of a-ff-y high affinity 
5 receptors. The cellular targets of a large number of the odier qrtokines are known and 
described in die revfews and other references cited above. Furthennoret followinig the 
approaches described in those references, any particular cell population or sub- 
population can be readily assayed for sensitivi^ to a given cytokine. 

The choice of a particular ligand may also be influenced by otfa» activities that 

10 may be possessed by the ligand (besides binding to the cell surfiice). For exanqile, 

GDFPs haviag B/T components derived from cytokines may possess ^tokme effector 
activity diat can be used to modulate the targeted cells in accordance with die activity 
of the cytokine, l^ically, ODFPs of this type will be prepared by incorporadog die 
entire cytokine coding sequence into a polynucleotide encoding the GDFP; although it 

15 will also be possible to remove portions of the cytokine sequence which are neidier 

required for binding to the receptor nor essential for cytokine effector activity. In sudi 
cases, the GDFPs can provide a combination of activities comprisiog: (i) binding to 
specific target cells; (ii) deliveiy of targeted nucleic acid into the targ^ed cells; and 
(iii) cytokine modulation of the ceUs thus targeted. Such a combination of activities 

20 will allow, for example^ the transduction of particular cells to be coupled to die 

proliferation of the transduced cells. This will be generally advantageous in the context 
of gene delivery since it can be used to promote the proliferation of die targeted cells 
in a given cell population; and will be particular advantageous for in vivo gene 
ddivery vAne it may be odierwise problematic or inqxissible to induce die targeted 
. 25 cells to divide, vibidt may be necessary for efiBdeiit. stable iixxxrpoiatlon of die 
transferred gene. 

In some cases, it wiU be preferable to make use of the recq^tor binding potential 
of a ligand such as a cytokme widiout concomitant effector activi^. This may be the 
case, for exanople, when a cytokine with suitable receptor bindirig pr opCT ti es bta a 
30 negative or unwaitfed eCfea on target cell activity. GDFPs of this ^pe can be 

prepared, for example, from cytokine sequ«ioes in ^wdiicfa the domain responsible far 
effector activity has been mutatiomdl^ altered by, e.g., substitution, insmion or 
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deledon. For example, IL-2 has been subjected to deletion analysis to identify which 
portions of the sequence are involved in receptor binding and which are critical for 
cytokine effector activity; see, e.g., Brandfauber, B.J. et al., J. Biol. Chem. 
262:12306-308, 1987; Brandhuber, B.J. et al.. Science 238:1707-09, 1987; Zurawski, 
5 S.M. et al., EMBO J. 7:1061-69, 1988; and Aiai, K., et al., Amu. Rev. Biodiem. 
59:783-836, 1990. The receptor binding and effector domains of a number of other 
cytokines have similarly been characterized; see Axai et al., id, and odier reviews and 
references cited therein. 

The rapidity with whidi novel ligands and their cognate receptors have rec«dy 

10 been molecularly cloned has generated a wide amy of diese molecules. In particular, 
the combination of direct cDNA e;q>ression clornqg and screeniiig assays for either 
induction of proliferation of bindipg to specific oeU sui&ce xecqptors on target cells has 
led to many new molecules being cloned (see, for example, Cosman et al.. Trends 
Biochem Sci IS: 265-269, 1990). The advent of these tedmolpgies will undoubtedly 

IS lead to the cloning of more ligands, including cytokines and other proteins tf«>t bind to 
cells, which, on the basis of their binding characteristics and specificity may be used in 
the context of die present invention as the B/T compomist of ttie GDFP. B/T 
components derived from the flk-2/flt-3 ligand (Lyman et al.. Cell 75:1157-1167, 
1993) will be of interest because the cytokine binds specifically to a recqytor, flk-2/flt- 

20 3, yf/tuch is expressed on early hematopoi^ cells (Manfaews, W. et al., CeU 65:1143, 
1991; and Small et al., P.N.A.S. 91:459-463, 1994). In the context of tbc present 
invention, GDFPs comprising a B/T component derived from the flk-2 ligand could 
thus be used to direct gene deliveiy to lynphohematopoietfe stem cells. 

While the foregoing principles have been illustrated using cytokines as a 

25 convenient example, these princq)les are also q>plicable to odier ligands c^iable of 
binding to ceU surfiaoes, indudii^ for example, antibodies, lectins, viral bindipg 
. proteins, cellular adhesion molecules, and any other proteins that assocute WiA cellular 
surfiftoes. 

For exanq>te,. a laige number of antibodies to ceU surftoe antigeiis have been 
30 identified and described. Antibodies to leukocytes have been well characterized and 
classified as the "CD" series of andgem; see, e.g., Coligan, J. et al. (ed.), "Cunent 
Protocols in Immunology," Current Protocols, 1992, 1994. Moreover, •ychniq^^ for 
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the isolation of new antibodies specific for a paiticular taigetceU are roiitiiie in the art. 
Useful antibodies will be those which interact with antigens on the surface of the 
desired target cells. Antibody/antigen binding can be readily determined and monitored 
by flow cytometry or other immunochemical detection methods. 
5 Of particular interest are antigens that arc exclusively or preferentialTy expressed 

on the surface of particular target cells. For example, the CD34 antigen is expressed 
on human lymphohematopoietic stem cells (Andrews et al.. Blood 80:1693-1701, 
1992). 

Transferrin, (see; e.g.. Zenke, M. et al., P.N.A.S. 87:3655-3659, 1990), can 
10 also be used as a B/T component in die context of die present invendoiL 

Targetiqg to certain celk, for example re^uratoty epithelial cells, can also take 
place via immunoglobulin dg) receptors (see, e.g., Ferkol, T., J. CUtl Invest. 
92:2394-2400 (1993). 

The GDFPs of die present invention can also be chemically modified, for 
15 exanqple by the addition of lactose to target the ODFP to asialoglycoprotein receptors 

and dms to hepatocytes of die liver (see, e.g., Neda, H. et al., J. Biochem. 266:14143- 
14146, 1991). 

Another group of proteins from which die B/T conqx>nents can be derived are 
lectins. A number of such molecules, and dieir cognate receptors, have been identified 
20 and characterized (see, e.g., the review by Lis & Sharon, Ann. Rev. Biochem. 55:35- 
67, 1986; and publications cited dierein). 

Proteins c^ble of targetmg the GDD and thus the GDFP/tNA complex to cell 
surfaces can also be derived from viruses. Many such viral proteins c^»ble of binding 
to cells have been identified, inchidii^, for exan^le, die well-known envelope ("env") 
•25 proteins of retroviruses; hemagglutinin proteins of RNA viruses such as die influenza 
virus; spike proteins of viruses such as the Semliki Forest vinis (Kielian and 
Jungerwirdi (1990) Mol. Biol. Med. 7:17-31); and proteins from non-envel(^ied viruses 
such ais aden&vuuses (see, e.g., Wickham et al.. Cell 73:309-319, 1993). 

As an illustrative example, m the murine leukemia vims (MaUV) system, it is 
30 well known dial the anurio-terrninal region of die gp76 molecule is involved m bindir^ 
to cell surfitce recqitors, see, e.g.. Heard and DarK», J. Viroi. 65: 4026-4032, 1991. 
Baltini et al., J. ^^1. 66: 1468-1475 (1992) have also reported dial portions of the 
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amino-tenninal region of gp70 can be exchanged in order to switch bindii^ to different 
MuLV env receptors without interfering with the ability of the protein to interact with 
pl5E TM protein (and» thereby, to mediate viral uptake); see also Weiss, R. et al. in 
Weiss, R. et al. (eds.), RNA Tumor Viruses, Cold Spring Haxbor, New York (1984 
S and 1985). Similarly, in the human immunode&iency virus (HIV) system, mutational 
analysis of gpl20 has identified portions of Ae molecule ixiiidi are critical for bindipg 
to the CD4 receptor, see, e.g., Kowalski, M. et al.. Science 237:1331-1355, 1987. 
Yet another approach to identify a region critical for receptor binding is as follows: an 
antibody known to inhibit binding can be used to immuno-afBnily purity a cleavage 

10 fragment of the vural bindiiig protein; which fiagmmt is dien partially sequenced to 

identify the correspondii« domain of the viral binding protein, see, e.g., Laskey, L.A. 
et al., Cdl 50:975-985. 1987. Such techniques can be employed in the present 
invention to generate GDFPs in which the M-D conq>onent remains capable of 
mediating iqptake of the GDFP/tNA complex (as described below), but the specificily of 

15 binding is principally determined by the presence of, e.g., cognate ^toldne receptors 
corresponding to a portion of the B/T conqxment, rather than viral bindiQg protein 
receptors. 

Another illustrative example of a viral protein that can be used is ttie G protein 
of VSV, which has been utilized to target infection by retroviral vectors; see, e.g., Emi 
20 et al., J. Virol., 65:1202-1207, 1991. 

Another group of proteins from v/indi die B/T components can be derived are 
cellular adhesion molecules. A number of such molecules, and their cognate receptors, 
have been identified and characterized (see, e.g.. Springs, T., Nature 346:425-434« 
1990, and pubUcadons cited dierenO- 

25 

(2\ Membrane-tHsmnting rM-D^ Components 

Menibrane-disnq>ting components are protem sequences c^Mdble df locally 
disrupting cellular membranes such tiiat die C9>FP/tNA c(»nplex can traverse a celhdar 
membrane* 

30 . M-D GotnpoDtias facilitating vpiakc of the GDFP-targeied nucleic acid conqsiex 

by target cells are typically membrane-active regions of protein structure having a 
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hydrophobic character. Such regions are typical in membrane-active proteins involved 
in facilitatuig cellular entry of proteins or particles. 

For example, viruses commonly enter cells by endocytosis and have evolved 
mechanisms for disrupting endosomal membranes. Many enveloped viruses encode 
5 surface proteins capable of disrupting cellular membranes including, for example, 

retroviruses, influenza virus, Sindbis virus, SemlQd Forest virus. Vesicular Suimatitis 
Virus, Sendai virus. Vaccinia virus, and mouse hepatitis virus; see e.g., Kielian and 
Jungerwirdi, Mol. Biol. Med*, 7:17-31, 1990; and Marsh & Helenius, Adv. Virus 
Res., 36:107-151, 1989. The mechanism for viral entry, in ^uliich a viral bindiqg 

10 protein binds to a specific cell surftce receptor and subsequently mediates virus entry, 
frequently by means of a hydrophobic membrane-disruptive domain, is a common 
theme among enveloped viruses, including influenza vims, and many such molecules 
are known to diose skilled in die art, see, e.g.. Hunter and Swanstrom, Curr. Top. 
Micro, and Immunol. 157:187, 1990; and the review by White, J., Science 258:917- 

15 924, 1992; and publications reviewed dierein. 

By way of iUustration, the M-D components of the present invention can thus be 
derived from a portion of a viral binding protein fliat is normally involved in mediating 
uptake of the virus into a host cell, or a mutein of such a portion of a binding protein. 
The portion of the GDFP that may be derived fincmi such a viral binding protein may, 

20 but need not, also contain the portion of the binding protein that causes die viral 

particle to associate with a specific receptor on a target cell (which latter portion mi^ 
thus function as a B/T conq>onent, as described above). A large number of viruses 
have been characterized and, for many of these, the nucleotide sequence of die viral 
genome has been published. The binding proteins encoded lyy various viruses generally 

25. share functional homology, even though there may be considerable variation among the 
primary amino acid sequences. Using the retroviruses to illustrate, the nairive env gene 
product is typically a polyprotein precursor that is proteolytically cleaved during 
transport to the cell sur&ce to yield two polypeptides: a glycosylated polypq>tide on the 
external sur&oe (the "SU* protein) and a menibrane-^anning or traasmemhrane 

30 . protein (flu; ""TM" protein); see, e.g.. Hunter, E. and R. Swanstram, Curr. Topics 

Microbiol. Immunol. 157:187*253, 1990. The SU proteins are responsible fiir binding 
to specific receptors on die surfiBoe of target cdls as a first step in die infiBCtion 
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process. The TM proteins, as well as associating with viral core proteins through their 
C-terminal ends, are responsible for a critical membrane fusion event whidi takes place 
after binding and allows entry of the virus into the cell (See, e.g.. Hunter and 
Swanstrom (1990) Curt. Top. Micro, and Immunol. 157:187; Kielian and Jungerwirth 
5 (1990) Mol. Biol. Med. 7:17-31; and Marsh & Helenhis (1989) Adv. Virus Res., 
36:107-151). The membrane fusion event is accomplished by a lydrophobic 
polypeptide sequence present at the amino terminus of the TM protein. Exanq)les of 
these pairs of SU and TM proteins and the viruses which produce them are: gpS2 and 
gp36 from mouse mammary tumor virus (Racevslds, J. et al., J. Virol. 35:937-48, 

10 * 1980); gp85 and gp37 from Rous sarcoma virus (Hunter et a!., J. Virol. 46:920, 1983); 
gp70 and pl5E from Moloney murine leukemia virus (Koch et al., 49:828* 1984); gp70 
and gp20 from Mason Pfizer monkey vuus (Bradac, J. et al., Virolpgy 150:491-502, 
198Q; gpl20 and gp41 from human immunodeflciengr vims (KowalsU, M. et al.. 
Science 237:1351-1355, 1987); and gp46 and ggUl fiom human T-CeU leukemia virus 

15 (Seiki et al., Proc. Nad. Acad. Sci. 80:3618, 1983); and ottim described m Ae 

references cited herein. The functional similari^ among diese types of proteins* is 
fiirdier illustrated by the well-documented phenomenon of "pseudotypir^," in which the 
core proteins and nucleic acid are provided by a first virus and tte envelope proteins 
(determining host range) are provided by a different virus (see, e.g.. Vile et al., 

20 Virology 180:420, 1991; MiUer et al, J. Virol. 65:2220, 1991; and Landau et al., J. 

Virol. 65:162, 1991). Examples of retroviruses yfhich can be used to derive fragments 
for use in the present invention include murine retroviruses such as Harvey murine 
sarcoma virus (HaTMSV), Kirsten murine sarcoma virus (Ki-MSV), Moloney murine 
sarcoma virus (Mo-MSV), various murine leukemia viruses (MuLV), mouse mammary 

25 tumor virus (MMTV), murine sarcoma virus (MSV) and rat sarcoma virus (RaSV); 

bovine leukemia virus (BLV); feline retroviruses such as feline leukemia virus (FeLV) 
and feline sarcoma virus (FeSV); primate r^roviruses such as baboon endogenous virus 
(BaEV), human unmunodeficien^ vhuses (HIV-I and HIV-II), human T-cell leukemia 
viruses (HTLVrl and HTLV-II),. Gibbon ape leukemia virus. Mason Pfizer monkey 

30' virus (M-PMV), simian immunodeficien^ virus (SIV) and simian sarcoma vtaus 

(SSV); various lentlviruses; and avian retroviruses sudi as avian eiy llkroblastosis virus, 
avian leukosis virus (ALV), avian myeloblastoris vims, avian sarcoma vims <ASV), 
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avian reticuloendotheliosis-associated virus (REV-A), Fujinami sarcoma virus (FuSV), 
spleen necrosis virus (SNV) and Rous sarcoma virus (RSV). Many other suitable 
.reat)viruses are Icnown to those skilled in die art and a taxonomy of retroviruses is 
provided by Teich, pp. 1-16 in Weiss et al.» eds, RNA Tumor Viruses , 2d ed., VoL2, 
5 Cold Spring Harbor, New York. Plasmids containiiig retroviral genomes are also 
widely available from the ATCX; and other sources. 

Infectious vurions have also been produced when ngs-retroviral binding proteins, 
such as the G protein of vesicular stomatitis virus or the hemagglutinin of influenza 
virus, have been pseudo-typed onto retrovirus cores (see, e.g., Emi et al., J. Virol. 

10 65:1207, 1991; and Dong et al., J. Virol. 66:7374, 1992). These latter examples 

indicate that there are functional commonalities b e t w een various viruses and their mode 
of entry into cells which will allow die use of viral bixdinig proteins from a varieQf of 
sources. Influenza hemagglutinin has also been reported to enhance the uptake of poly- 
L^lysine-based chemical conjugates (Wagner et al., P.N.A.S. 89:7934-7938, 1992). 

IS The sequences of a large number of viral bindii^ proteins are known, and are 

gennally available from sequence databases such as GENBANK. nirtfaermore, 
po^mudeotides encoding viral bindiqg inoteins can be readily obtained from viral 
particles diemselves. Also, since many different genes encoding viral binding proteins 
have been cloned and diaracterized, plasmids containing DNA encoding the bindimg 

20 proteins are available from a number of different sources. Polynucleotides encoding 
viral binding proteins can also be obtained using polymerase chain reaction (PGR) 
mediodolc>gy, as described, fcxr exanqile, MuUis and Faloona (1987) Medi. 
Enzymology 155:335. 

As an illustrative embodiment of die present invention, a GDFP may conq>rise a 

25 region' of a gene encoding a viral binding protein including a B/T conqKment, in which 
. case die GDFP can be used to target cells including those normally susceptible to the 
virus from which the gene was derived. In odier embodiments of the present 
invention, the targeting may be restricted to cells bearing r ecept ors far other types of 
ligands, discussed above under the descrqition of die B/T conqxment. For exanq[>le, 

30 where an M-D conqKmnxt is derived from a viral bindmg protein that retahis die abiliQr 
to bind to the viral recqitor, but it is desirable to limit targeting to cdb bearing. e.g., 
an appropriate cytokine recqnor, dne are several approaches that can be used to 
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achieve such specificity. One approach is to utilize a B/T component which is based 
on a cytokine with a veiy high binding affinity for the desired target cells compared to 
the binding affinity of a domain derived from a viral bindipg protein for the native viral 
binding protein receptors. Since many of die cytokines are known to exhibit very high 
5 affinity binding to their receptors, and since it will be feasible* for example, to base the 
M-D component on a lower-binding-afifinity viral binding protein, targeting can be 
effectively focused upon those cells bearing a cognate cytokine zcceptor. Anodier 
suitable approach to limitii^ binding is to derive the M-D ccmiponent from a mutant 
viral binding protein in which the mutation has disnqrted the ability of the protein to 

10 engage in binding via the native viral binding protein receptor but has not interfiered 

with the ability of the viral bindiqg protein to mediate viral uptake. Plasnuds encoding 
such mutant viral bmding proteins are available in the art; and it will also be wdl 
within die ability of one skilled in the art to prepare new versions of such viral binding 
protein mutants by deleting portions of die coding region 6r by introducing amino acid 

IS substitutions into die coding sequence as described above. 

While the foregoing principles have been illustrated using viral proteins as a 
convenient example, these princ^les are also applicable to odxer polyp^tides capable 
of disrvqpting cellular membranes (see, e.g., the review by Whilet J., Science 258:917- 
924, 1992, and publications reviewed therein). 

20 Other domains that ate functionally and/or structurally analogous can be derived 

from various viral, prokaryotic or eukaryotic sources. As a furdier specific exanq>le, 
bacterial toxins such as diphtheria toxin have a specific domain with a highly 
alpha-helical structure and a hydrophobic character (known as the *TM" domain in the 
case of diphtheria toxin) that becomes protonated at low pH and disnqits cellular 

25 mqnbranes, facilitating entry of the toxin into the cells (see, e.g.. Choc et al. (1992) 
Nature 357:216-222; vanderSpeck et al, J. Biol Chan 268: 12077-12082, 1993; and 
Parker & Pattus, Trends Biochem. Sci. 18:391-395, 1993). Toxins such as 
Pseudomonas exotoxin A have a similar membrane-disnqyting domain (see, e.g., Strom 
et al.. Am. N.Y. Acad. Sd. 636:233-250, 1991). Similar M-D components can be 

30 derived from other bacterial toxins such as hemolysin (Suttorp et al., J. Exp. Med., 
178:337-341, 1993). As described herein, inclusion of such membrane disruptive 
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components in the GDFP would facilitate membrane disnq)tion and entiy of the 
GDFP-tNA conq)lex into die target cells. 

Cytolytic pore-forming proteins, such as streptolysin O, perforins expxessicd by 
cytotoxic T lyn^hocytes, and S.aureus alpha toxin also have the abiliQr to disrupt 
S membranes (see, e.g., Ojcius and Youi«. Trends Biochem. ScL, 16:225-230, 1991; 
Suttorp et al., J. Exp. Med., 178:337-341, 1993). Strepto^in O has been shown to 
facilitate uptake of DNA by cultured cells when added to the culture medium (Bany et 
al., Biotechniques 15:1018-1020). There are maiiy bacterial cytolysins which have die 
capability to induce membrane disnqption (see, e.g., Braun and Focareta., Cxil. Rev. 

10 * Microbiol. 18:115-158, 1991; and van der Goot et al.. Nature 354:408-411^ 1991). 
Membrane disnQ>tion often occurs by means of a pH-induced hydniphobic change in 
the protein, but this can also occur by en^matic means, such as diose involvirig 
phospholqpases (see, e.g» Braun and FocavMu, CriL Rev. Microbiol. 18:115-158, 1991 
(and references cited dierein); and London, Mol. Microbiol. 6:3277-3282, 1992). 

15 Where a pH shift is required to induce the membrane disruption function, tfaoe are 
several ways in wUch diis can be achieved. For example, the GDFP/tNA complex 
may be taken up dirougli acid endosomes; or the pH of the extracellular medium may 
be transiently lowered to mediate activation of die membrane disruption functicm. In 
some cases (dq>htheria toxin for exanqile), oizymatic niddiv of die membrane acdve 

20 component prior to an induced pH change in die surroundiiig medium is believed to 

promote membrane disruption (see, e.g., Sandvig and OIsims, J. Cell Biol. 87:828-832, 
1980; Moslcaug et al., J: Biol. Chem. 263:2518-2525, 1988; and Zatanan and 
Wisnieski, Proc. Natl. Acad. Sci. 81:3341-3345, 1984). WeD-known en^mes such as 
trypsin and urokinase have been successfully used to provide the nieifitig activity in 

25 vitro (see, e.g., ^lliams, D.P., et al., J. Biol. Chem. 265:20673-20677, 1990). 
Enzymes capable of providing die nicking activi^ are also known to be found on 
cellular surfiEu:es (see, e.g., Willianos, D.P., et al., kl.). Exenqplary construction and 
characterization of GDFPs containing the dipdieria toxin transmembrane r^on are 
described below in Example 8. . . . 

30 Other sources of M-D components inchide bacterial proteins diat promote «try 

of organisms into cells, suth as die 52kD entry protein of Mycobaderium mberadasis 
(Arruda et al.. Science 261:1454-1457, 1993); die intemalin protein otUsieria 
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monocytogenes (Portnoy et al.. Inf. Imm. (U.S.) 60:1263-1267); and the invasin 
protein of Yersinia enterocoUtica (Yoixqg et al., J. Cell Bio., 116, 197-207), among 
others. 

Synthetic analogs of membrane-disniptii^ domains can also be made. See, e.g., 
5 Kaiser and Kezdy, Science 223:249-253, 1984. 



(Ti Transport/Localization fT/L) Components 

Transport/localization components mediate or augmem die transport dod/m 
localization of the GDFP/tNA complex to a particular sub-celhilar con^artment such as 

10 the nucleus or mitochondrion. 

A number of sequences that mediate transport and/or localizaticm of proteins 
have been identified. These mchide, by way of iUustradon, die nuclear localization 
sequence (nls) of SV40 T antigen (CoUedge* et al., Mol. Cell Bio. 6:413^4139, 1986); 
and the HIV matrix protem (Bukrinsky et al.« Nature 365:666-669, 1993)* Tliese ate 

15 typicaDy short basic peptide sequences, and may also be bqiartite basic sequences (see, 
e.g., Garcia-Bustos et al., Biochim. Biopfays. Acta 1071:83-101, 1991; and Robbins et 
al.. Cell 64:615-623, 1991). Nuclear localization sequences have been fused to 
heterologous proteins and sho^ to confer on them the property of nuclear localization 
(see, e.g., Biocca et al., EMBO J. 9:1Q1-108, In the case of the human 

20 estrogen receptors, for example, fusion proteins traffic to the nucleus in an estrogen- 
dependent fashion (Ishibashi et al., J. Biol. Chem. 269:7645-7650, 1994). These 
sequences can be readily incorporated into the GDD by recombinant DN A methodology 
to facilitate nuclear localization of the desired GDFP/tNA complex. GAL4 has also 
been shown to posses nuclear localization properties in yeast (see, e.g., Sflver, et al., 

25 P.N.A.S. 81:5951-5955, 1984), and dnis, as a component of a GDFP, GAL4 may be 
used as both an NBD and a GDD widi a role in transport/localization. 

(A\ Replicon Integration am Components 

Replicon integration conqx)nents mediate or augment integration of the taigeted 
30. nucleic acid into a replicon of the target cell, such as a chromosome. In many 
insranms in gene transfer and gene dierapy it is advantageous to obtain stable 
integration of transferred DNA uno the genome of die target odl. The GDFP.cam 
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facilitate such integration. Also, as described herein, a particular advantage of the 
Type I GDFP approach is that it not only allows the stoichiometric attachment of 
deliveiy components to the tNA, but also allows the GDFP to be positioned at pie- 
determined locations with respect to the tNA. In the case of a rq>licon integration 
5 component such as an integrase, the GDFP can be positioned in proximity to terminal 
integrase recognition sequences as a means of facilitatii^ integration, as described in 
more detail below. 

DNA-protem interactions can mediate integration of DNA into the mammalian 
genome. For example, the mtegration of all known letioviiuses takes place in an 

10 enqonatic reaction that makes an endonucleolytic cleavage of die host DNA and ligates 
the reverse-transcribed retroviral genome to the free ends of the host cell DNA. This 
reaction is mediated by the retroviral integrase (or "IN*) protein, and it is weU known 
that the IN protein interacts with a minimal number of bases present on the ends of the 
pre-integradve viral genome to achieve integration. Indeed, DNA sequences beaiinig 

15 the IN sequence recognition motif can be inserted into ftee DNA in vitio 1^ purified 
IN proteins (see, e.g., Bushman et al.. Science 249:1555*1558, 19SK>; and Katz et al.. 
Cell 63:87-95, 1990; see also. Brown et al. CeU 49:347-356, 1987; and Roth et al.. 
Cell 58:47-54, 1989). For example, the MLV, HIV and RSV IN protems are eadi 
known to interact widi a distinct short IN sequence recognition motif present at each 

20 end of the linear pre-integrative viral DNA substrate to mediate its integration into the 
host cell replicon. In vitro integration mediated by purified IN protein has been 
demonstrated using eidier free oligonucleotides or synthetic DNA substrates beariog the 
IN recognition sequence motif (see, e.g., Katz et al., supra; and Bushman and Craigie, 
J. Virol. 64: 5648, 1990). Syntiietic DNA substrates can be readily engineered by 

25. inserting a unique restriction enzyme site (^ically Miel), flanked by the appropriate 
IN recognition sequences, into a plasmid vector. Digestion of die vector widi Ndd 
yields a DNA substrate widi 3* recessed ends preceded by the highly conserved 
5*CA-OH dhnicleotide and the remainder of the appropriate IN recognition motif, 
which resenibles the proc^sed ends of jdbe pre-integrative viral DNA wilh witich IN 

30 interacts to mediate mtegration. This iqpproach has been successfully used to 

demonstrate in vitro integration of such linearizied DNA substrates into double-stranded 
DNA in vitro by purified recombinant avian retrovirus IN (Katz et al., supra). 
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MoMLV IN (Bushman and Craigie, J. VizoL, supra) and HIV IN (Bushman et al.. 
Science, supra). The IN recognition sequences used on the tennini of die substrate 
DNA in these experiments were short (10-30 base pairs), demonstrating that 
heterologous DNA substrates with short tenninal IN sequence recognition motifs can be 
S integrated into double-stranded DNA by IN. Moreover, ttiese e^q^eriments document 
successful integration of both ends of the DNA substrate into the target DNA, as 
opposed to the oligonucleotide integration experiments which assay only for integration 
of a sii^e end of the substrate DNA* These experiments document that linear DNA 
substrates bearing short tenninal IN recognition motifs can be integrated into 

10 double-stranded DNA in vitro by purified IN protem. Thus, the foregoipg expeiime n ts 
provide further evidence of the utility of the present invention, in Aat IN components 
can be included in the GDFP and can act in concert with terminal IN recognition 
sequence motifs present on the (substrate) fNA to mediate ^ficient integration. 
Recombinant fusions between integrase and heterologous proteins have previously been 

15 constructed, expressed and shown to retain integrase en^matic activity (see, e.g., Vink 
et al.» J. Virol., 68:1468-1474, 1994). Moreover, Bushman (PNAS 91:9233, 1994) 
has shown that recombinant fusions can be made between integrase and a sequence- 
specific DNA binding protem, and that such fusions retam integrase activiQr and 
sequence-specific DNA bindipg. 

20 Thus, for exanple, by including an RI conqponent derived firom an integrase 

protein in the GDD, and using a tNA bearing the IN recognition sites, die GDFP can 
be co-introduced with tihe targeted nucleic acid (fSA) bearing die integration 
recognition motif and thus achieve integration of die tNA into a replicon of the target 
cell. This system would also allow, in conjunction an appropriate binding domain 

25 in the NBD, for the association of die RI component with die free end$ of the tNA. 

This would be advantageous since the IN proteins of r^roviiuses function to mediate 
integration at die free ends of pre-integrative viral DNA. In the present invention, this 
can be achieved by utiiiring a Type-I GDFP in coiyunction with a lineariiifld INA 
containing the cognate recognition sequence ("CRS") for die NBD at (or in close 

30. proximity to) die ends of die tNA beariiig the tenninal IN sequence recognition motif 
(preferably less dian SOO nucleotides firom the IN sequence^ more preferably less dian 
200 nucleotides, most preferably less dian SO nucleotides). To generate die tNA tor 



wo 95/28494 



PCT/US9Sm738 



-35- 

gene delivery, the tNA can be constnicted, for exaiiq)le, with a unique Ndel site 
between the two IN recognition motifs, flanked by the cognate recognition sequence of 
the NBD. Digestion with Ndel would then generate a linear DNA molecule widi 3* 
recessed ends preceded by the S*CA-OH dinucleotide, the remainder of the IN 
S sequence recognition motif, and the CRS for fhe NBD, m that order. In this way, the 
terminal IN recognition sequences would be closely linked to the cognate recognition 
sequence for the NBD. Typically, the Ndel site/IN recognition sequence/CRS cassette 
would be inserted into the plasmid backbone of the vector. However, it is possible to 
construa a tNA devoid of any extraneous or undesirable sequences; for exansple, a 

10 tNA devokl of any bacterial plasmid sequence can be generated by flanking each end of 
the mjiynmaiifln expression cassette in the tNA widi a CRS, followed by ooc half of the 
IN recognition sequence, followed 1^ an Ndel site. Digestion by Ndel would then 
generate a linear tNA DNA fragment, whidi could be readity purified from the plasmid 
backbone fragment, having on each end die IN recognition sequence and the CRS. 

15 Removal of plasmid backbone sequences may be desirable to achieve optimal gene 
regT2lation in fhe transduced cells. Binding of the GDFP would then locate the RI 
component containiiig the IN region in close proximi^ to the sites on the tNA witii 
which it can mediate efficient integratk>iL An analogous strategy can be used witii die 
AAV Rep protein and viral ITRs (see. e.g., Owens al., J. Virol. 67:997-1005 

20 (1993), and die review by Carter, B.J., Current Opmion Biotedi. 3:533*539 (1992) and 
publications reviewed therein). Additionally, odier recombinase systems such as die 
bacteriophage PI ere recombinase, the yeast FLP recombinase, die yeast SRl-derived 
R recombinase and die Tyl integrase (see, e.g., Kilby €L al.. Trend Genet., 9:413-421, 
1993; Moore and Garfinkel. PNAS 91:1843-1847, 1994) can be used m die context of 

25 the present invention using fusions with appropriate NBDs, cis-acting recombinase 
recognition sequences aiKl CRS elements, in an analogous fieidiion to that described 
above for die integrase fkisknis. 

Muhhnerizing die RI domain may be required fi^ opttmiring integration 
activity, especially ui siniations ui whidi die protein fixim 

30 derived flmctions in ihultimeric form. Thus, for example, many native r^roviral IN 
proteins are dimeric or nuiltimeric in form (see, e*g., Jones et id., J. Biol Chem 23: 
16037, 1992; reviewed in Skalka, Gene 135:175, 1993). Multimerization of dK IN 
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domain can be conveniently achieved by, for example: (a) constructing tandem repeats 
of the IN component in the GDFP, preferably separated by a flexon; (b) dimerizing the 
GDFP by insertion of a protein dimerizadon motif, e.g., a leucine zipper motif (see, 
for example, Hu et al., 1990, Science 250: 1400); (c) adding free IN protein to an IN- 
5 containing GDFP (since IN proteins have a natural tendency to multimerize); or (d) 

multimerizing the CRS in the tNA such that multiple GDFP molecules bind to each end 
of the tNA. Combinations of the above strategies can also be used. This would result 
in further multimerization of the IN component and thus a more active integration 
complex. 

10 Another strategy to achieve multimerization of the RI domains, and also to 

achieve more efficient conceited integration of the tNA (i.e. haviqg both ends of die 
tNA integrate into the replicon), would be to esigineer the sysnan to bring the free ends 
of the linear tNA together. This can be achieved in a nunaber of ways in the context of 
the present invention. First, the GDFP monomers can be designed in such a way that 

IS they self-dimerize, using a leucine :^n>er or odier modf as described above. Thus, 

dimerization of the GDFP bound to the tNA would result in close opposition of the fiee 
ends of the tNA, since the CRS is located near fliese termini. A second apptoaeh 
would be to use a second, separate DN A binding protein wiA additional cognate 
recognition sequences present near the temoini of the tNA to bring the ends of the tNA 

20 together (for this purpose any of the DNA binding domains alluded to above could be 
used in dimeric form togedier with a tNA having the appropriate cognate recognition 
sequence to associate the free DNA ends). Such strategies would bring Ae free ends of 
the tNA in close supposition to one anotho* and tinis may further enhance the frequency 
of concerted integration. Several approaches can be used to avoid any potential 

25 problem that may arise from GDFP/tNA con^lex auto-integration (i.e. integration of 
the ends of the tNA molecule into itself), or cross-integration of one tNA moleoile into 
anotiier. Although complexmg of the GDFP wiOi tbc tNA could be done at thus 
reducing IN enzymatic activity, placiiig the complex onto cells at higher tenxperatuies 
could lead to such unwanted integration events. A preferred upptosach is to use a 

30 . conditicmal RI moiety. Such conditional RI moieties can be dependent on chemical or 
protein co-factors, or can be mutants diat are conditional for fliU activity dependent on 
tenq>erature or other variables, such as the pres e tioe or absence of inhibitors or co- 
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factors. For example, in the case of IN, tempeFature-sensitive (ts) IN mutants have 
been made that are active only at certain temperatures (see, for example, Schwartzbeig 
et al.. Virology 192:673, 1993). The use of a r5 RI component would allow eiqposuie 
to and uptake of the complex by cells to be done at the non-permissive temperature 
5 (such that the RI coxaponsta would not be active), followed by switching to the 

pennissive temperature once the complex was taken up into the nucleus, allowing the 
RI compoMBSt to be active in the context of the host cell replicon and thus accomplish 
die desired integration. 

Thus, mchision of an RI QonnponssBt in die GDFP can be used to enhance 

10 frequencies of integration. The GDD can consist of an RI conqK>nent alone, or it can 
in addition comprise one or more of the otiier components discussed above. Where die 
RI component is die sole coniponent in the GDD, the NBD would function to associate 
die RI component more stably and/or more specifically with the free ends of the tNA 
than is possible tluxm^ for example, use of die recombinant native IN protein alone. 

IS By vntue of the NBD bindfa«, die RI moiety is more ti^itly associated witii die tNA 
termini during die transfection process and can mediate integration into the host cell 
replicon. Where die RI conqKment is the sole component in die GDD, tlie d^A/GDFP 
complex can be delivered by any of the standard means of transaction, such as 
lipofection, electroporation, etc., and the resulting cells would have an mhanmrt 

20 frequency of stable gene delivery as a consequence of enhanced integration of the tNA. 
Alternatively, die complex can be delivered by other non-viral means, including for 
example die use of self-assembling systems such as viral cqisid proteins. Experimental 
evidence has confirmed that viral capsid proteins can be used to introduce DNA into 
mammalian cells (see, e.g., Forstova et al., Hum. Gene Ther., 6:297-2k)6, 1995). 

25 In certain cases, such as die retroviral IN or AAV proteins^ components of 

the GDD can also function as effective NBD conqxments and thus fidflll a dual 
fuiDction in the GDFP by virtue of their ability to bind nucleic acid (see, e.g., 
Krongstad & Chanqxmx, J. VnoL 64:2796*2801, 1990; Owens et al., J. Virol. 67:997- 
1005 (1993); and the review by Carter, BJ., Current Opinion Biotecta* 3:533-539 . 

30 (1992) and publications reviewed therein). 
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2. The Targeted nucleic acid ftNA^ 

The targeted nucteic acid (tNA) is a polynucleotide, or analog thereof, to be 
delivered to a target cell. Thus, targeted nucleic acids inchute, for example, 
oligonucleotides and longer polymm of DNA, RNA or analogs thereof, in double- 
5 stranded or single-stranded form. The tNA may be either circular, supercoiled or 
linear. A preferred exaittplt of a targeted nucleic acid is a DNA esqyression vector 
conq>rising a gene (or genes) of interest operably linked to a transcriptional control 
region (or regions) and a cognate recognition sequence capable of beii% bound by die 
NBD domain of the GDFP. The transcriptional control region may be selected so as to 

10 be specifically activated in die desired target cells, or to be responsive to specific 
cellular or other stimuli. 

Targeted nucleic acids may also include, for example, positive aiKl/or negative 
selectable maricers; diereby allowing the selection for and/or against cells staUy 
expressing die selectable marioer, eidier in vitro or in vivo. 

IS Use of die present invention to deliver RNA would enable tihe introduction of 

RNA deo^s (Sullei^ et al.. Cell 63:601-608, 1990); ribozymes (YouQg ct al.. Cell 
67:1007-1019, 1991); and antisense nucleic adds (\ndQet8 ec al.. Nucleic Acid Res., 
19:3359-3368), for example. 

In T^^I GDFPs, the targeted nudeic acids are recognized and bouiKi by the 

20 GDFP by virtue of specific cognate recognition sequences to vftitdti die nucleic add 
bindmg domain (NBD) of the Type-I GDFP binds. Both DNA and RNA bindir^g 
domains have been isolated from proteins that bind to particular nucleic acids in a 
sequeixre-specific fashion. Inclusion of such a cognate recognition sequence in the 
targeted nucleic acid allows for specific bindiiig of the GDFP to the tNA. Recognition 

25 sites for many nucleic acid bindiqg proteins have been identified (see, for example, 
Mitchell. & l^ian, 1989; and other refermces herein). 

Binding of sequence-specific binding proteins to DNA tends to be imire avid 
when die recognition sequeiKe motif is multimerized (see, e.g., Hochschild and 
Ptashne, Cell 44:681-687, 1986). Accordingly, die cognate recognition sequnoesmay 

30 be multimerized hi the targeted nucleic acids so as to enhance the binding afiBniiy or 
selectivity of a GDFP for its cognate tNA. Tliis could also have other advantages, 
such as uxireasu« die effective amount of die GDFP bound to die tNA, or promotiog 
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compaction/condensation of the tNA by sequence-specific or sequence-non-specific 
NBD components. 

Typically, but not necessarily, the cognate recognition sequences in expression 
vectors will be placed in the plasmid backbone of die vector. This also applies to other 

5 cis-actii% sequences that are needed in the tNA to facilitate gene delivery. However, it 
may be desirable to remove plasmid bacldx)ne sequences fixim the DNA to be 
transferred. In this case, the expression cassette can be conveniently flanked 
restriction en^me sites, such that restriction en^rme digestion scpzrsass the backbone 
from the mammalian agression cassette. The eaqiression cassette can then be purified 

10 away from ibt plasmid backbone for use in transduction experiments. Qearly in dds 
case die CRS wouU need to be located on die fragment beariqg die ejqxiession cassette. 
It is also possible, of course, to construct die GDFP so as to bind to more than one 
tNA. 

As discussed above, the tNA can also be bound to the GDFP via sequenoe-non- 

IS specific interacdons in addition to sequence-specific interactions. In a lype-I GDFP, 
such sequence-non-specific interactions can be mediated by auxiliary c omponen ts 
derived from sequenoe-non-spedfic bindiiig proteins, as discussed above. Such 
auxiliaiy non-specific binding conqxments can also serve to compact or otherwise 
reconfigure the targ^ed nucleic acid; see, supra. 

20 The taigeted nucleic acids can also include, for exanqile, non-expressed DNA, 

such as sequences homologous to sequences present in a target cell rqilicon, that can 
thereby mediate homologous recombination. This can be used to facilitate the stable 
integration of the targeted nucleic acid, or a desired portion thereof, into a specific site 
in a rqilicon present in die target cell, such as a specific site in a cellular chromosome. 

25 This may be useful, for exan^le, to achieve a desired level of e3q>ression of the tNA 
by integration at a desired chromosomal site. Homologous recombination can also be 
used to aher a specific DNA sequence in a tazg^ cell replicon (see, e.g., Thomas & 
Capecchi, Cell 51:503-512, 1987). 

For longer tNA sequences, or where the tNA tqMake mechanism (whedier part 

30 of the GDFP or not) is known or suspected to be sensitive to the size, form or charge 
of nucleic acids and/or complexes to be delivered, such as mechanisms involving 
endocytosis, it may be desirable to condense and/or charge neutralize tbe tNA. This 
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can be achieved by mixing the tNA with any of a number of proteins or other agents 
(collectively referred to as "compacting agents") that can condense and/or charge 
neutralize nucleic acids. Compacting agents include, for example, histones (see, e.g., 
von Holt, Bioassays 3:120-124, 1986; and Rhodes, Nucleic Acid Res., 6:1803-1816, 
S 1979); or polypeptides derived thereftom (Rodriguez et al., Biopl^. Chem., 39:143- 
132, 1991); as well as the non-histone high mobili^ group proteins. Poly-L-lysme or 
other polybasic amino-acids can also be used as compactipg agents (see, e.g., U et al.. 
Biochemistry, 12:1763-1772, 1973; and Weiskxypf and Li, Biopolymers 16:669-684, 
1977). Similarly, other polycationic polymers such as polyamines, for exan^le 

10 spermine and spermidine, and cationic lipid-containing polymers can also be used to 
condense and/or charge neutralize nucleic acid (see, e.g., Feuerstein et al., J. Cell. 
Biochem., 46:37-47, 1991; and Behr, BioconJ. Chem., 3:382, 1994). Retroviral 
nucleocapsid proteins can fulfill a similar role (see, e.g., Gelfaod et al. J. Biol. CSiem., 
268:18430-18436, 1993). 

13 Alternatively, compacting agents can be incorporated as an additional 

component of the GDFP. Also, some sequence specific binding proteins, such as 
GAL4, which exhibit a range of binding affinities to different cognate nucleic acid 
sequences may also be used in this capaci^, and in diis regard would function as an 
NBD with bodi nucleic acid binding and conqiaction properties. 

20 Compacting agents might also be incorporated as mediators of indirect binding 

between the tNA and the NBD domam of the GDFP (for exanq^le, the NBD domain 
can be bound to the compacting agent and the compacting agent bound to the tNA). 

A^sanMYPfcprPs 

23 Preferabty, the GDFP is prepared as a single polypeptide fhsion protein 

generated by recoinbinant DNA methodology. To goierate such a GDFP, s e q ue n oes 
encoding the desired components of the GDFP are assembled and fiagments Ugated into 
an e^qiression vector. Sequences encodiog the various Gom|xnients may be asseinbU 
from odier vectors encoding the desired protein seq u ence, from PCR-generaied 

30 fiagments using cellular or viral nucleic acid as ten^late nucleic acid, or by assembly 
of synflietic oligonucleotides encoding die desired sequraoe. However, all nndeic acid 
sequences encoding such a preferred GDFP should preferably be assenibled by in-frame 
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fusions of coding sequences. Flexons, described above, can be included between 
various components and domains in order to enhance the ability of the individual 
components to adopt configurations relatively indqpendently of each other. 

Althou^ a Type-I GDFP is preferably assembled and e>q>ressed as a single 
5 polypeptide chain, one or more of its domains or components may be produced as a 
separate chain that is subsequendy linked to the GDFP by, e.g., disulfide bonds, or 
chemical conjugation. It is also feasible to prqiare conq>lexes in which domains such 
as the NBD and the ODD or their componsm are pl^icaUy associated by other than 
reccmibinant means, either directly or indirecdy, for example, by virtue of non-covalent 

10 interactions, or via co-localizaiion on a proteinaceous or lipid surfiioe. 

Ihe GDFP may be es^ressed ehher in vitro, or in a prokaryotic or eukaryotic 
host cell, and can be purified to the extent necessary. An alternative to the ei^ression 
of GDFPs in a host cell is syndiesis in vitro. This may be advantageous in 
circumstances in. which high levels of e3q>ression of a GDFP might interfere with the 

15 host cell's metabolism; and can be accomplished using any of a variety of cell-free 
transcr^tion/translation systaoQS that are known in die art GDFPs can also be 
prepared synthetically. It will likely be desirable for the GDFP to possess a conqxnxmt 
or seqmmoe that can fiicilitate the detection and/or purification of die GDFP. SiKh a 
component may be die same as or different from one of the various oonspoDcatB 

20 described above. 

tAzny iqyproaches of expressing and purifyir^ recombinant proteins are known 
to diose skilled in die art, and kits for recombinant protein expression and purification 
are available from several commercial manufacturers of molecular biology products. 
Typically, an increased level of puri^ of the GDFP will be desirable. However, 

25 because of the specifici^ of the GDFP for nucleic acid binding, die degree of 

purification need not necessarily be extensive. The GDFPs of the present invention 
may be sterilized by simple filtration through a 0.22 or 0.4Su filter so as to avoid 
microbial contamination of the target cells. 

Since the domains of the GDFP can be assembled in modiilaf fashion in an 

30 expression vector, its construction by recombinant DNA methodology allows the GDD 
to consist of one or many components. Such components may have conylememing 
activi^ ill mediating or enhancing^ne deliveiy, or diey may have closely related 
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functions. In essence, the gene delivery domain can be viewed as possessing any 
function that mediates or enhances the efficiency of deliveiy of the tNA boimd to tlie 
GDFP. 

S Other Variations of GDFPs 

Other variations will be apparent to those of skill in fhe art. For example, the 
GDFP may itself be nmltimerized. Multimerization may be advantageous to increase 
avidi^ of binding of either the NBD or the GDD. A given tNA molecule may also 
contain nniltiple distinct cognate xecogniticm sequences, binding different Type I 

0 GDFPs with distinct functions, or the tNA may be bound wifli a mixture of l^pe I and 
Type n GDFPs. Additionally, certam conqponents of the GDD, such as IN proteins, 
may require dinoerization for optunal acdvi^. Dhnerization of die GDFP may be 
obtained by inchidix^, for example, a leucine zipper motif in the GDFP. Such motift 
are common in DNA binding proteins and are responsible for dieir dimerization 

S ^ouzarides & 2ff, 1989). Leucine zippers can be inserted into DNA binding proteins 
and cause diem to dimerize (Sellers and Struhl, Nature 341:74-76, 1989). 
Multimerization of GDFPs can also be adiieved, for exanq>le, by creatiiig a 
recombinant fusion protein that contains two or more GDFPs. Preferably such 
multimerized GDFPs ate separated by flexons, as described herein. Other 

0 oligomerization moti£5 from dimeric or multimeric proteins can similarly be employed. 

mnstrations of Tvoe^n Gene PeHverv Fusion Proteins 

Type-n GDFPs do not bind targeted nucleic adds in a sequence-qiecific mannCT 
LS because the nucleic acid bindii^ components of IVpe-II GDFPs are all derived from 
nucleic acid binding proteins that are non-sequence-specific in their binding to nucleic 
acid. 

Nucleic Acid Binding Domains of Tvoe^n GDFPs 
10 The mideic acid binding domains (NBDs) of Type-II GDFPs cono^nise bnidmg 

components that are derived firam non-sequenoe-spedfic nucleic acid bindii^ proteins, 
recombinantly fused to a eeoc delivery domain (GDD) as described above. 
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A number of non-sequence-specific nucleic acid binding proteins have been 
identified and characterized, including, for example, histones or polypeptides derived 
therefrom (see, e.g., von Holt, Bioassays 3:120-124, 1986; Rhodes, Nucleic Acid 
Res., 6:1803-1816, 1979; and Rodriguez et al., Biophys. Chem., 39:145-132, 1991); 
3 retroviral nucleocapsid proteins (see, e.g., Gelfond et al. J. Biol. Chem., 268:18430- 
18436, 1993); protems such as nucleolm (Erard et al., Eur.' J. Biochem. 191:19-26, 
1990); avidin (Pardridge & Boado, F.E.B.S. Lett 288:30-32. 1991); and polybasic 
polypeptide sequences sudi as poly-L-lysine (Li et al.. Biochemistry, 12:1763-1772 
1973; Weiskopf and U, Biopolymers 16:669-684, 1977). 

10 For the reasons discussed herem, aU of die GDFPs of the present invention are 

preferably produced as recombinant fusion proteins. However, the recombinant 
expression, in a host cell, of non-sequence-specific nucleic acid binding c o mp o n en ts in 
Type-n GDFPs (as well as in Type-I GDFPs that incorporate sequence-mm-qiecific. 
nucleic acid binding conqioneiits) may be hindered by intbrferenoe of the e^qpressed 

IS proteins with host ceU nucleic acids. In such situations, the GDFPs can be readity 

syndiesized in vitro using aiqr of a variety of cell-ficee transcription/translation ^sterns 
that are known in the art 

Gene Delivery nnmflfng nf TVne-n GDFPs 
20 The various possible somoes of components making up die gene delivery 

domains of Type-U GDFPs are essentially die same as described above for Type-I 
GDFPs (although, by definition, Typ^n GDFPs would not include sequence-specific 
binding components such as the sequence-specific integrase components described 
above for T>pe-I GDFPs). 
■23 . 

Targeted Nucleic Acids for Use with Tvoe-H GDFPs 

The targeted nucleic acids to be combined widi Type-II GDFPs are as described 
aboye except that they need not contain specific recognition, sequences since die Type-II 
GDFPs bind nucleic acids via noii-specifiq hueractions. 



30 
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Assembly of TvpcH GDFPs 

The assembly of Type-II GDFPs is preferably via the synthesis of lecombinant 
fusion proteins (see the description above regarding assembly of Type-I GDFPs). 

5 Using GDFPs of the Present Invemion 

Thus, the GDFPs of the present invention can be used for in vitro or in vivo 
gene deliveiy. For therapeutic applications, taiget cells can be transduced ex vivo and 
returned to a patient, or, given the biochemical nature of the tNA/GDFP conq>lex, cells 
can be treated directly in vivo. For such m vivo therapy, die complexes can be 

10 formulated for a varieiy of modes of administration, inchidinig systemic and topical or 
localized administration. Techniques and formulations may be found, for example, in 
Remington's Pharmaceutical Sciences. Mack Publishing Co., Easton, PA. (latest 
edition). The tNA/GDFP complex may be combined with a carrier such as a diluoit or 
exctpient which may include, for exanq>le, fillers, extenders, wetdng agents, 

IS disintegrants, surface-active agents, or lubricants, depending on the nature of the mode 
of administration and the dosage forms. The nature of the mode of administration will 
depend, for example, on die location of the desired taig^ cells. For in vivo 
administration, injection is preferred, including intramuscular, intratnmoral, 
intravenous, intra-arterial (including delivery by use of double balloon cadietera), 

20 intraperitoneal, and subcutaneous. Delivery to lung tissue can be accomplished by, 
e.g., aerosolization. For injecdon, the complexes of the invention are formulated in 
liquid solutions, preferably in physiologically compatible buffers such as Hank's 
solution or Ringer's solution. In addition, die conq>lexes may be formulated in solid 
form and redissolved or suspended immediately prior to use. Lyophilized fbnns are 

2S also included. Systemic administration can also be by transmuoosal or transdennal 

means, or tibe con^xiunds can be administered orally. For transmucosal or transdennal 
administratipn, penetrants s^ropriaie to die barrier to be permeated are used in the 
formulation. For topical administration, the complexes of die invention taay be 
formulated into ointments, salves, gels or creams, as is generally known in die art. 

30 The GDFP qiproach can dms be used to target any cell, in vitro, ex vivo or in 

vivo, the only requirement being that die target cells have binding sites for the GDFP 
on then: surface. The present invention wiU thus be useftd for many gene dierapy 
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applicadons. As an illustrative example, the target cells that could be used in die 
context of the present invention include lymphohematopoietic cells. These include: (i) 
stem cdls, which have many potential applications in gene therapy, including 
correction of hereditary disorders such as Gaucher disease and hemoglobinopathies, as 
5 well as genetic modification with intracellular vaccines against HIV such as decoys or 
dominant-negative proteins; and (ii) lymphocytes, which would allow genetic 
modification of effector T cells such as CTLs for use in human therapy ^th genes of 
interest such as suicide genes and regulated promoter cassettes. Also included for use 
in die context of die present mvention are cells of the cardiovascular system which line 

10 blood vessels including endothelial cells and vascular smooth muscle cells, which could 
be geneticaUy modified to inhibit adierosclerosis or restenosis foUowiqg aogic^lasty. 
Similarly, die present invention could be used to introduce genes into airway q>idieUal 
cells, such as die CFTR gene to correct cystic fibrosis. The present invention could 
also be used to transduce tumor cells and thereby genetically modify them to express 

15 suicide genes for tumor elimination or produce ^toldnes or express immunosrimulatmy 
molecules for use as a tumor vaccine in cancer patients. Anodm illustrative 
applicaticm of die present invention is deliveiy of DNA or RNA to antigen presenting 
cells (APCs). This could be usefld, for exan^ile, to allow es^ression of specific (OfA- 
encoded) antigens by an AFC, thereby allowing the AFC to stimulate an antigen- 

20 q>ecific miwwwft response, such as a CTL response. SucSi an approach can be used in 
vitro, l>y transduction of APCs with a GDFP/tNA conq)lex therdiy allowing antigen 
presentation for die stimulation and generation of CTLs in vitro, or in vivo deliveiy 
can be used, to allow such antigen presentation in vivo. Direct deliveiy of RNA to 
APCs using die present invention may be especially desirable for situations in winch 

25 antigens are encoded by transcr^ that require special conditions for intraceltnlar 
transport or processix^ that may not hqipen efficiendy in the APC. An illustrative 
exanqile would be rev-dependjent RNAs of HIV (such as HIV gag). Transductioii of 
APCs widi RNA in the context of the present invention can thus be used, for example, 
to circumvoit the need for iniclear export of rev-dqiendent RNAs. Additionally, the 

30 present invention could be used to introduce genes into hepatocytes of die liver to 
correct genetic defiBCts such as familial hypercholesterolemia, hemophilia and odier 
metabolic disoidera, or to produce recombinant products for systemic deliveiy. 
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Similarly, fibroblasts or connective tissue cells could be modified to secrete qrtokines 
or soluble enzymes for immunomodulatory purposes or to correct a metabolic 
deficiency. These tissue targets and diseases, together widi others are more fully 
described in Scriver et al.. Eds., 'The Metabolic Basis of Inherited Disease*. 6tfa Ed.» 
5 McGraw-Hill. 1989. and in MiUer, A.D.. Blood 76:271-278. 1990. The present 

invention is particularly useful in cases in which genes of fasterest cannot be transferred 
by commonly used viral vectors, or in which the target cells are not infectable by vital 
approaches (see, eg., Israel & Kaufinan, Blood, 73:1074-1080, 1990; Shimotohno & 
Temin, Nature 299,263-268, 1982; Stead et al.. Blood, 71:742-747. 1988; and Bodine 
10 ' et al.. Blood, 82:1973-1980, 1993). 

The GDFP abroach of the present invention can be used as a generically useful 
method for gene transduction of cells, and could be provided as a laboratory idt fbr 
gene transduction for use witii, e.g., insect, a^iian, mammalian, or odm lugber 
eukaryotic cells.. 

13 The transfer of genes in the present invention can also be facilitated by other 

biochenucals known to enhance the uptake of nucleic acid by cells (see, e.g.. Kawai & 
Nishizawa. Mol. CeU. Biol. 4:1172-1174, 1984; Behr etal., P.N.A.S. 86:6982-6986, 
1989; Rose et al., P.N.A.S. Biotechniques 10:320-323, 1991; Pardridge & Boado, 
F.E.B.S. Lett. 288:30-32, 1991; Legendre & Szoka. P.N.A.S. 90:893-897. 1993; 

20 Haensler & Szoka, Bioconj. Chem. 4:372-379. 1993). Tbese and other techniques for 
use in die context of the present invention can be used under conditions (for incubation 
etc.) as described in die art (see. e.g., Kriegler, M. 1990 (ed.), "Gene Transfer and 
Expression, a Laboratory Manual," (1990)). In Ae case of GDFPs comprisiAg pH- 
dq)endent M-D components, such as the TM protein of diphflieria toxin (see, e.g., 

25 Choe et al. (1992) Nature 337:216-222), entry of die GDFP/tNA conq>lex into die oeU 
can be conveniently achieved by sinyly reducing the pH of the incubation medium 
during transductioiL 

Tbc examples presented below are provided as a fiirfher guide to the practitioner 
of ordinary skill m die art« and are not to be construed as limiting the invention in any 
30 way. 
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Example 1 

Preparation of a Nucleic Acid Binding Domain (NBD) 
From the Yeast GAL4 Protein 
The DNA binding domain of GAL4, amino acids 1-147 (Laughon and 
S Gesteland, Molecular and Cellular Biology 4:260-267, 1984; Ma and Ptashne, Cell 

48:847-853. 1987; and Carey et al., J. Mol. Biol. 209:423-432, 1989), was amplified 
by PCR firom ceievisiae (ATCC 60248) using the following anq>l]mers. 

The amplimcr for the 5* end of GAL4 was as follows: 
5' GCC5C ACTAGT GCCACC ATG AAG CTA CTB TCT TCT ATC G y. 
10 The GAL4 coding region is underlined. Tliis an^ilimer created a Spel site 

(ACTAGT) for cloning into pBtuescrq)t (Stratagene) which allowed for subsequent 
transcription by T3 RNA polymerase. The anqpliun* also included a oonsmsus 
sequence (GCCACQ for efficient protein translation located upstream of the initiator 
methionine (Kozak et al., Nud. Acids Res. 13:3374, 1987). 
IS The amplimer for the 3' end of the OAL4 NH2-tenn]nus (up to amino acid 147) 

was as follows: 

5- GCGC GGTACC TCCCq A TAC ACT C AA CTC TCT TTg ACC 3\ 

The GAL4 codiqg region is underlined. This anqplimer created a 3' A8p718 site 

(GGTACQ far cloning into pBIuescript as noted above. The anopUmer also inchided a 
20 BspEl site (TCCGGA) to allow f or an inrframe fosion with an oligomer encoding a 

flexible p^tide sequence (see bekiw). 

The GAL4 fragment was amplified by 30 cycles of PCR directly from a coloi^ 

of S^cexevis|as. The product was digested widi Spel aiid A^718 md ligated between 

the Spel and Asp718 sites located in the poly lihte region of pBh^script. The 
25 construct was transformed into the DHIOB strain of £^ coli by electroporation, and a 

colony contaiiiii^ tte GAM fragment was ideiitified by restriction 

The resulting plasmid, designated pT3gGAL4, is shown in Fig. 2A. 
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Preparation of a Gene Delivery Domain 
From the Human IL-2 Protein 
A DNA ftagment encoding mature soluble human IL-2 (amino acids 21-133) 
5 was amplified by PCR from a full-length human ILr2 cDNA (Tanigucfai et al.. Nature 
302:305-310. 1983), using the followixig amplimers. 

The amplimer for the 5' end of mature human IL-2 was as follows: 
5 - GCGC ACTAGT GCCACC ATG GCG CCT ACT TCA APT TCT AC A AAG 
AAAu^3\ 

10 The IL-2 coding region is underlined. This amplimer created a Spel siie 

(ACTAGT) for cloning into pBluescrqyt, and insetted an initiator methionine 
immediately iq)stream of amino acid 21 of IL-2. The anqilimer also contained a 
consensus sequence (CSCCACC) for efficient translation upstream of die inserted 
methionine, as noted above. A Narl site (GGCGCC) was also included wbidb allowed 

IS for a subsequent in-frame fusion with a liiiker sequence vAdsh separated die GAL4 and 
IL-2 domains in die GAL4/IL-2 construct (see below). 

The anqplimer for the 3' end of mature human IL-2 was as follows: 
5' GCGC GGTACC TCA AGT CAQ AGT ACT GAT GAT GCT TTG ACA AAA 
GGT AAT C 3\ 

20 This amplimer created an Asp718 site (GGTACC) for cloning into pBluescript, 

and also retained the wild-type terminadon codon for human IL-2. A Seal site 
(AGTACT) at die 3' end of the IL-2 coding region was also created this amplimer 
widiout introducing amino acid changes. The DNA fragment encoding die mature 
hunian IL-2 protem was amplified 1^ 30 cycles of PCR from the fuU-lengdi human 

25 IL-2 cDNA referred to above. The product was digested with Spel and Aq>718, 
ligated into pBluescript, and transformed into DHIOB cells as described above. A 
colony haifooring an appropriate construct was identified by restriction enzyme analjrsis. 

Sequencing of a plasmid derived from one colony revealed that an alteration 
Goss of a single base resuhiqg in a frame-shift near tihe terminus of n>2) had dccuired 

30 within d» 3* an^lnner during PCR cloning - dierd>y generating an IL-2 nratein. 

Specificalty, die first T after die Sea 1 site (m die first GAT tripte) was removed^ 
causing' a fi:ame-shift that also generated a premature termination codon. As a result. 
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the 5 amino acids nonnally present at the tenninus were replaced by 3 different amino 
acids. This plasmid, referred to as "pT3matIL-2m'' (shown genericaify in Figure 2A as 
pT3matIL-2), was used to create a gene deliveiy fusion protein as described in 
Example 3. Despite the variation in the IL-2 domain, a GDFP based on this IL-2 
S mutein exhibitBd IL-2 bioactivity, as described below. 

A second colony contained a plasmid designated ''pT3matIL-2" <as shown 
genericaUy in Figure 2A) diat contained the expected wild-type ILr2 sequence. 
Plasmid pT3inaaL-2 was used to create two GDFPs as described in Examples 3 and 4. 

10 ' Exampte? 

Construction of Plasmids Encoding a Gene Deliverv 
Fusion Protein rGDFP^ Having a onn ^nd an NBD 
Separated bv a Flexon 
A DN A firagment encodii^ the nudeic add binding domain (NBD) derived fiom 
IS GAL4 was isolated firom pT3gGAL4 (Exanqyle 1) by digesting wiOi Spel and BspBl. 
A DNA fitagment encodiqg die gene delivery domain (GDD) derived finom a human 
TLr2 mutein was isolated from pT3maaLr2m (Example 2) by digestii^ wiA Narl and 
Asp718. The fbllowiic oligomer pair encoding Ae flexon sequence 
(G^lyGIyGlySer), was annealed creatiiig a S' B9EI over-haqg (CCGGA) and a 3* 
20 Nari over-hang (CGOC^: 

S* CCGGA GGC GGT GGA TCC GGTT GCT GGA GGC AC3T GGA GGA GGT GGC 
TCGG 3*; 

5* GGC CGA GGC ACC TOG TCC ACT GGC TCC ACC ACC GGA TCC ACC 
GCCT3*. 

25 The NBD and GDD fragments and die annealed oligomer were ligated into 

pBluescript between die Spel and Asp718 sites, and transfbnned into DHIOB cells as 
described above. A colox^ harboring a construct diat contained all three fragments was 
identified by its ability to hybridize to bodi GAI4 and IL-2 pF]-labded fragments, and 
by restriction eiuyme analysis,. 

30 In the resulting plasmid, designated *pT3GAL4/IL-2m" (shown generically in 

Fig. '2A as pT3GAL4/IL-2), the sequence encoding the GDFP was msected into 
pBluescrqyt in an orientation which allowed for sense RNA transcripts to be synthesized 
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with T3 RNA polymerase. The resulting RNA, when translated, incorporated both the 
DNA binding domain of the yeast GAL4 protein and the mature form of the human 
IL-2 niutein, in that order, separated by a flexible amino acid linker. 

A second plasmid, designated pT3GAL4/IL-2 (as shown in Fig.2A), was 
5 constructed exactly as described for pT3GAL4/IL-2m, except that the DNA firagment 
encodinig die gene delivery domain (ODD) derived from human IL-2 was isolated from 
pT3matIL^2 (Example 2). 

Example 4 

10 Construction of a Third Plasmid Enco dmg a Gene Deliverv 

Fusion Protein (GDFP\ Havfag a GDP and an NBP 
Sffarated Py a nem^ 
Another e3q)ression vector encoding a GDFP derived from 11^2 and GAL4 was 
constructed as follows. 

15 The DNA bmduig domain of GAL4, amino acids 1-147 (Carqr, et al., siqpra), 

was amplified by 30 cycles of PGR from pT3gGAL4 using die following amplimers. 

The an^limer for die S* end of (3AL4 was as foUows: 
5' GCGC GGATCC ATG AAG CTA CTG TCT TCT AXC G 3\ 

This anq)lmier created a BamHl site (GGATCC) immediately upstream of Met^ 
20 to allow for an in-frame fusion widi a flexible pqitide sequence bi front of GAL4 (see 
below). 

The amplimer for the 3* end of the GAL4 NH2-terminus <up to amino acid 
147) was as follows: 

5* (3CGC GGTACC G CTA GCT TAC AGT CAA CTG TCT TTG ACC 3\ 
2S This anq>luner created an Asp718 site (GGTACC) for cloning into pBluesa:q>t 

and also included an engineered termination codon (CTA) at the C-terminus of die 

DNA binding domain of GAL4. 

To construct pT3IL-2/GAL4, the GAL4.PCR product was digested widi BamHl 

and AspTlS. A DNA fragment encoding human ILr2 was isolated from pT3madL-2 
30 (see Example 2) by digesting widi Spel and Seal. The following oligomer pair 

encoding die amino acid sequence (G^lyGlyGlySer)| was annealed, creatiqg a S* 

Seal over-hang (ACT) and a 3* BamHl over-haqg (GATCQ: 
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S' ACT CTG ACT GGA GGT GGG GGC TCT GOT GGC GGA GOT AGT GGA 
GGA GGT G 3*; 

5' GA TCC ACC TCC TCC ACT ACC TCC GCC ACC AGA GCC CCC ACC TCC 
AGT CAG AGT 3'. 

The IL-2 and GAM fragments and oligomers were ligated into pBliiescrq>t 
between the Spel and Asp718 sites and die construct was transformed into the DHIOB 
strain of E^. fioU by electroporation. A colony containing all duee fragments was 
identified by its abili^ to hybridize to bodi GAL4 and IL-2 p^-labeled fragments, 
and by restriction enzyme analysis. 

In die pT3IL-2/GAL4 construct, shown in Figan 2B, the GDFP was inserted 
into pBluescript in an orientation which allowed for a sense RNA to be syndiesized 
with T3 RNA polymerase. The resulting RNA, when translated, incocporated both the 
mature form of human IL-2, and the DNA binding domain of die yeast GAL4 protein, 
in diat order, separated by a flexible amino acid linker. 

E^camoleS 

Expression of Gene Delivery Fusion Protems 
Sense mRNA encodizig the GAL4/ILr2m GDFP construct (described in Exanqile 
3) was transcribed ja vitro widi T3 RNA polymerase from the pT3GAL4/IL-2 vector. 
Brieffy, pT3GAL4/IL-2m plasmid was linearized with Asp718 and diis template was 
combined widi a ribonucleodde mixture (rNTPs), RNA cap structure analog 
(mTGppp), and T3 RNA polymerase in a HEPES-based buffer (Promega *'RiboMAX*). 
After incubation at 37 degrees C, die DNA tenq>late was digested with RNase-fiee 
DNase (Promega), and die synthesized mRNA was separated from unincoiporatBd 
rNTPs by chromatogr^hy through a G2S Sqphadex spin cohnnn (Boehringer 
Mannheim), precipitated widi EtOH, and quantitated by ODm- 

The resultant mRNA. was translated in a cell-free rabbit reticuloq^ lysate 
system. mRNA was added to a translation mixture of reticulocyte lysate, RNasin, and 
c(mq>lete amino acids G^tomega). Translation was allowed to proceed for 1 to 2 hr. itt 
30 degrees C, after which lysates were stored at -70 degrees C. Tbe integrity and 
moleodar weight of the fusion protein was assessed by including pS}-labeled 
metfiibnine (Amersham) in the translation mix, and visualizhig the product by 
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polyaciylamide gel electrophoresis under denaturing conditions. Fig. 3, lane 2, shows 
the [^S]-labeled GAL4/IL-2m translation product as resolved on a 14% acrylamide gel. 
The position of the GAL4/IL-2m GDFP translation product agreed with the predicted 
MW of 33kD. Molecular weight markers are shown in Fig. 3, lane 1, and a negative 
S control is shown in lane 3. 

Sense mRNAs encodiqg the GAL4/ILr2 GDFP construct (described in Exanq[>le 
3) and the IL-2/GAL4 GDFP construa (described in Exanq)le 4) were transcribed m 
vitro with T3 RNA polymerase from the pT3GAL4/IL-2 and pT3IL-2/OAL4 vectors, 
respectively, exacdy as described above. Figure 6, lanes 3 and 4, show the 
10 pS]-labeled JLrVGAIA and GAL4/IL-2 translation products as resolved on a 4-20% 
gradient acrylamide gd. The positions of the IL2/GAL4 and GAL4/IL-2 GKT 
translation products agreed with the predicted MWs of 33.3kD and 33.2lcD, 
respectively. Molecular weight markers are shown in Fig. 6, lane 1, and a lucifeiase 
control is shown in lane 2. 

15 

Seouence-Specific DNA BmHtfip ^i^ty ftf Qr}fff 
The abili^ of the GDFPs of Exanxple S to engage in sequeiKe-specific DNA 
binding was demonstrated by use of an electrqphoretic mobili^ shift assay (EMSA) 
20 (Ausubel et al. (eds), ^'Current Protocols m Molecular Biology," (1987 and 1993)). 
The target oligomer to which the GAL4 protein binds was: 
5* TCGACGGAGTACrTGTCCTCCGC 3' 
3* GCCTCATGACAGGAGGCGAGCT 5\ 
The following target oligomer is not bound by GALA and was used as a 
25 negative control: 

5' TCGACTGAGTACTGTCCTCAGC 3* 
3' GACTCATGACAGGAGTCGAGCT S\ 
The GAL4 target oligomer was end-labeled usiqg [^]-dCTP (Amersbam) and 
Klenow polymerase (New England BioLabs). The labeled oligomer was sqiarated 
30 from unincoxporated nucleotides by chromatograpliy over a 02S ^in column 

(Boehripger Mannhenn), aivl quantified 1^ scmtillation countipg. This oligomer was 
added to reactions contairuqg a HEPES-based bufifer, wbkh included 
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poly(dI-dC) poly(dl-dC) (Phaimacia), ZDCI2, glycerol and BSA (Carey, et al., J. MoL 
Biol. 209:423-432, 1989; Chasman. et al.« Mol. and Cell. Biol. 9:4746-4749, 1989), 
and varyii^ amounts of reticulocyte lysate containiAg either GAL4/IL-2m GDFP or 
JLr2. The reactions were electrophoresed on a 4.5% polyaciylamide/1% glycerol gel 
5 in 0.5x TBE. The gel was fixed in a methanol/acetic acid solution, dried, and analyzed 
on a Molecular Dynamics phosphorimager • The results shown in Figure 4 show 
decreasing amounts of input GAL4/IL-2m fusion protein (lanes 1-3) showiqg qiecific 
interaction of the GAL4/IL-2m GDFP with the labeled taiget oligomer. DNA size 
maricers zppear in lane 4. In lane S, the GAL4/ILp2m GDFP was incubated wiOi 

10 labeled target oligomer, as in lane 1, but excess unlabeled targ^ oligomer was also 

included and conq>eted widi die labeled target oligonm for bindiiig to die GAL4/lL-2m 
GDFP. In lane 6, the GAL4/IL-2m GDFP was mcubated with labeled target oligomer, 
as in lane 1, but excess unlabeled non-bindipg oligomer was included, and showed lack 
of conq)etition for binding of the GAL4/IL-2m GDFP to die labeled target oligomer. 

IS In lane 7, labeled target oligomer was incubated widi lysate containing human IL*2, 
and showed no specific bindiiig of the labeled target oligomer by eidier IL-2 or 
reticulocyte lysate components. The GAL4/IL-2m GDFP thus bound specifically to the 
cognate target sequence recognized by the GAL4 (NBD) domain of the GAL4/ILr2m 
GDFP. 

20 The results in Figure 7 show sequeni^specific bindii^ of the GAL4 protein and 

the ILrTJGAIA and GAL4/ILr2 GDFPs. Target oligomers, binding conditions, 
electrophoresis and gel treatment were exacdy as described above, except that analysis 
was by autoradiogrqihy. The first four lanes contained decreasing amounts of input 
GAL4 protein, as indicated, showing specific interaction of GAL4 widi die labeled 

25 target oligomer. The following lanes contained decreasing amounts of eidier input 
GAL4/IL-2 GDFP or iqnit IL-2/GAL4 GDFP as imiicated in Figure 7. The 
designation "+c" indicates diat the GDFP was incubated with labeled tatg^ oligonoer, 
as in previous lanes, but excess unlabeled target oligomer was also included in die 
reaction. The designation "-f m" indicates that the GDFP was incubated with labeled 

30 target oligomer, as in previous lanes, but excess unlabeled non-bindiog oUgcmier was 
included in the reaction. The unlabeled target oligomer co m p e ted with die labeled 
targfiL oligomer for binding to die GDFP while die non-binding oligomer showed lack 
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of conq)etition, demonstrating specific binding of die GDFP to the GAL4 recognition 
sequence. In the lane designated "IL2," labeled target oligomer was incubated with 
lysate containing human 0^2, and showed no q>ecific binding of (he labeled target 
oligomer by either IL-2 or reticulocyte lysate components. The IL-2/GAL4 and 
5 GAL4/ILr2 GDFPs thus bound specifically to the cognate target sequence recognized 
by the GAL4 (NBD) domain of die GDFPs. 

J&;aippte7 
Cvtokme Bioactivitv of GDFPs 

10 GAL4/IL-2m, GAlA/ILrl, and IL-2/GAL4 ftision proteins from js vitro 

translations (as in Example 5) were assayed for Aehr IL-2 activities using tbc 
well-known CTLL bioassay. Cells were incubated with the GDFP, then pulsed widi 
H-diymidme and incorporation of radioactivity into DNA was used as a measure of 
cellular proliferation, as described Gillis et al., J. Immunol. 120:2027, 1978. 

15 The results from die GAL4/IL*2m GDFP are shown in Figure S. The IL-2 

standard represents 1 ng/ml recombinant human ILr2 whidi was serial]^ diluted 1:3 in 
the bioassay. The GAL4/IL>2m GDFP curve was generated usiQg in vitro translated 
material starting with a 1:10 dilution of lysate. The bioassay shows retention of IL-2 
biological activi^ by the GAL4/ILr2m GDFP. 

20 The results from the GAL4/IL-2 and IL-2/GAL4 GDFPs are shown in Figure 8. 

The IL-2 standard was as described above. The GDFP curves were generated usiqg jfi 
vitro translated material starting wtdi a 1:50 dihition of lysate. The bioassay shows 
retention of IL-2 biological activity by the GAL4/IL-2 and IL-2/GAL4 GDFPs. 

25 Example 8 

Construction and Characterization of GDFPs Co^t^iniT^g t]]y Diohtiieria Toxm 

Transmfcmbrane Region, 
ThA trgngfngfnhrafife (TM) dnmain ftf Diphihesria toxin pmtefai from 
diphtheriae. amino acids 205-378 (Cfaoe* et a!.. Nature 357:216-222, 1992), is Ac 
30 * region responsible for endosomal release of die catalytic domain of die toxm into die 
cytoplasm of uifected cells (Papuu, et al., JBC 268:1567-1574, 1993, Madsbus, JBC 
269:17723-17729, 1994). This region has also been diown to be cqiabte of celhilar 
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membrane insertion in response to a mildly acidic environment (Moskaug, et ai. JBC 
263:2518-2525, 1988, McGiU, et al., ENfBO 8:2843-2848, 1989). To incorporate this 
domain into the pre-existing GDFPs, two DNA fragments encoding the transmembrane 
domain of the Diphtheria toxin protein were amplified by PGR from f , rfiph^i^ yrit^f 
5 genomic DNA. The first fragment, termed "DT," encoded amino acids 205-378 and 
was anqplified using die foUowu^ amplimers. 

The anqplimer for the 5' end was as follows: 
5* GTAGATCTGGTGGAGGTGGCTCCGGAGGAGGT GGATCC GAT TOG GAT 
GTC ATA AGG GAT AA 3' 
10 The amplimer for the 3* end was as follows: 

5 • CTTC AGATCT GGATCC T CCA CCG CCA CTA CCT CCA CCC CCG GGA 
CGA TTA TAC GAA TTA TGA AC 3* 

The toxin TM sequences are underlined. Bofli ampUmers provided BamHl sites 
near the termini for subsequent cloniqg of tlie PCR fragment into BamHl-digested 
15 pT3IL-2/GAI>i (Example 4). 

The second fragment, termed "DAB," encoded amino adds 176-378, and 
provided additional residues at the amino terminus of the transmenibrane region. 
Within die context of die intact toxin protein these additional sequences are involved in 
an enzymatic cleavage stq> which may be necessary for membrane fiisogenic activi^ 
20 (^miiams, et al., JBC 265:20673-20677, 1990, Ariansea, et al.. Biochemistry 

32:83-$K>, 1993). The "DAB" fragment was amplified usimg the following ainplimers: 

The amplimer for the 5* end was as follows: 
5' GCG GGATCC GGT GGC GGA GGA AGTGAT GCG ATG TAT GAG TAT 
ATG GGT C 3' 

25 The amplimer for the 3* end was the same 3* anqplimer used to PCR the above 

described "DT" fragment 

The toxin TM sequences are underlined. The tesultipg "DAB* PCR fragmrat 

was digested wiOi BamHl, and cloned into, die BandSl sites of bodi pT3GAL4yiL-2 

and pT3II/-2/GAL4.(Exan9les 3 and 4). 
30 . Thus, duee triple-domain fusion plasmids were generated ^T3IL-2DTGAL4, 

pT3IL-2DABGAL4, and pT3GAL4DABILr2), each containing a version of the 

diphflieria toxin transmembrane region as die middle domahi. Eadi of the fluee GDFP 
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RNAs was translated and the resulting fusion proteins were assayed for retention of 
IL-2 bioactiviQr and specific nucleic acid binding abiliQ^ (Examples S, 6, and 7). All 
three triple-domain GDFPs were found to retain these activities. 



The yeast transcripdonal activator, GAL4, has specific affinity for several 
closely related 17bp double-stranded DNA sequences, and it has also been shown to 
bind consensus synthetic 17bp target sequences widi a similar affinity as it does 

10 wild-Qfpe sequences (Giniger, et al.. Cell 40:767-774, 1985; Bram and Kornberg, 

P.N.A.S. 82:43-47, 1985). Target vectors were made by ligadng oligomers containing 
a consensus 17bp sequence (Webster, et al.. Cell 52:169-178, 1988; Carey, et al., J. 
Mol. BioL 209:423-432, 1989) inu> the unique Sail she in the backbone of 
pDC302CAT (Mosley et al.. Cell 59:335-348, 1989; and Ovcrell et aL, J. bnm. Metfa. 

15 141:53^, 1991), a plasmid which directs the expression of chloramphenicol aoe^l 
transferase (CAT) in mammalian host cells. The following oligomers were used: 



5 



ExanmleQ 

Construction of a Targeted Nucleic Acid (tNA^ 



5* TCGACGGAGTACTGTCCTCCGC 3* 
3' GCCTCATGACAGGAGGCGAGCT S\ 



20 



25 



30 



The oligomer pair harbored an internal Seal site (TCATOA) which was used to 
screen resulting transformants for presence of the oligomer. In addition, there -were 
Sall-conq)atible over-hangs at both the 5' and 3* ends, only one of ^^lich could 
regenerate the Sail site upon ligation. This feature was incorporated to allow the 
release of oligomer multimers for plasmid characterization. The oligomer pair was 
annealed by boiling equal amounts of each in a moderate salt bufSer, then slow cooling 
the reaction. The annealed oligomers were dien kinased with T4 polynucleotide kinase 
(Boehringer Mannheim) and ligated to pDC302CAT vidiich had been linearized with 
Sail. The constructs were electr opora ted into DHIOB coli and resulting colonies 
were screened for die pr ese n ce of the target oligomer, or muttfaneis Aereof, by 
hybridization to a [^]-labeled tar;^ oligomer probe. Colonies harboring the target 
oligomer were further characterized for number of copies by restriction enzyme 
analysis and sequenciqg. 
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Ability of GDFPs to Bind to IL>2 Recept or-Bearing CTLL 
GAL4/IL-2 and IL-2/GAL4 GDFPs from la vitro translations (Exanq>le 5) were 
further demonstrated to bind CTLL (see Example 7) via the following assay. CTLL 
5 were incubated in IL-2-ftee medium for 2 hours or longer. pS]-labeled GAL4/IL-2, 
ILr2/GAL4, IL-2, and GAL4 (Example S) were incubated with die CTLL for 1 hour at 
4 degrees C in a binding medium containiog 25mg/ml BSA and 2mg/ml Na-azide in 
kPMI-1640 buffered with 2QmM HEPES. The bind^g medfam was adjusted to a final 
pH of 7.2 prior to use. After bindqg, the cells were washed duee times in ice ooM 

10 PBS, and the final cell pellet was resuspended in a Tris buffer containing ISQmM 

NaCl, SmM EDTA, 0*02% Na-azide, and 0.5% Triton X-100 to gently lyx die cells. 
The lysaie was spun briefly, and the supematam was electrophoresed tlmmgli a 4-20% 
gradient polyaciylamide gel. Figure 9 diows Ubded protein present in the CTLL 
lysaie and, therefore, associated with the CTLL. Lane 1 shows a molecular weight 

IS standard. Lane 2 shows the human IL-2 protein as present in die unreacted reticulo^te 
lysaie (Example 5), and lane 3 is die CTLL tysaie after bindiqg to ILr2. Lane 4 shows 
the GAL4/IL-2 GDFP as present m die unreacted reticulocyte fysate, and lane 5 is die 
CTLL lysaie after bmding to die GAL4/ILr2 GDFP. Lane 6 diows die IL-2/G AL4 
GDFP as present in the unreacted reticulocyte lysate, and lane 7 is the CTLL lysate 

20 after bindiqg to the IL-2/GAL4 GDFP. Lane 8 shows GAL4 as present in die 

unreacted reticulocyte lysaie, and lane 9 is die CTLL lysate after binding to GAL4. 
The GAL4/ILp2 and IL-2/GAL4 GDFPs and IL-2 dms bmd specificalty to CTLL ^iule 
GAL4 does not This demonstrates that CTLL-specific bindiqg of the GDFPs is 
mediated by die IL-2 domain and not by the GAL4 domain. 

25 

A^)il|ty of GAlAfJlrZ mi Il^?/<?AL4 OPFPs V> M<4»ft PWiffg ofa T«»tt 
Oligomer to IL>2 Receptor-Bearing CTLL 
GAIAHLrl and IL-2/GAL4 fusion proteins from q vitro translations (as in 
30 Example 5) were bound to [^]-dCTP-CTd*labeled GAL4 target oUg 

in Example 6. The GDFP-tNA conqilex was bound to CTLL, as described in Example 
10; foer 1 hour at 4 degrees C in bmding medhim containiqg TSmgfiaal BSA and 2mg/ml 
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Na-azide in RPNa-1640 buffered with 20mM HEPES. The binding medium was 
adjusted to a final pH of 7.2 prior to use. The cell-bound GDFP-tNA complex was 
separated from free GDFP-tNA by centrifiigation of the binding mixture through a 
phthalate oil layer (Dower, et al., J. Exp. Med. 162:501-515, 1985). Cell-associated 
5 counts were quantified by scintillation counting. Figure 10 shows counts of labeled 

oligomer associated widi CTLL as mediated by the GAL4/ILr2 GDFP, the ILr2/GAL4 
GDFP, GAL4, and a negative control reticulocyte lysate designated "Bg." The bindii^ 
assay demonstrates the abiliQr of both GAL4/IL-2 and IL-2/GAL4 GDFPs to mediate 
binding of Ae oligomer tNA to CTLL. 

10 

Example 12 

Ability of the G AlAflLrl GDFP to Mediate Binding of a Target Plasmid to n^2 

Rffc;q>t(ff-Peaimg CTLL 
The GAL4/ILr2 GDFP (Example 5) was bound to the targ^ plasmid usinig 

15 binding conditions described in Example 6. The plasmid contained eight copies of die 
GAL4 17-mer target oligomer, as described in Example 8. The GDFP-tNA complex 
was bound to CTLL for 1 hour at 4 degrees C in binding medium as described in 
Exan^le 10. CTLL were then washed three times in ice cold PBS, and the final cell 
pellet was resuspeiuled m a Tris buffer containing 150mM NaCl, 5mM EDTA, 0.02% 

20 Na-azide, and 0.5% Triton X-100 to gently lyse the cells. The cell lysate was spun 
briefly, the supernatant was brought to 0.4N NaOH, and die sanq>le was denatured at 
60 degrees C for 1 hour. The san^le was then applied via slot-l>tot onto GeneScreen 
Phis membrane (MEN)- The blot was screened for the presence of the target plasmid 
by hybridization to a pP]-labeled CAT probe. The membrane was washed, and the 

25 signal from cell associated plasmid was quantified by a phoq)horiinager (Molecular 
Dynamics). Figure 11 shows association of plasmid to CTLL mediated by either the 
GAL4/IL-2 GDFP or a ni^ative control reticulocyte lysate designated *Bg." The 
binding assay showed the abili^ of die GAL4/IL-2 GDFP to mediate Undtav of 
plasmid tNA to CTLL. 

30 

Utility The gene delivery fusion tmtems of die present invention are useful m 
creatipg non-viral gene delivery systems fbr deliveriog a potynudeotide to it target cdl. 
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L A fusion protein useful in delivering a targeted nucleic acid to a target cell, 
5 comprisiiig a gene delivery fusion protein (GDFF), said GDFP comprising a nucleic 
acid binding domain (NBD) that binds to the targeted nucleic acid, fused to a gene 
delivery domain (ODD) Oat mediates delivery of the targeted nucleic acid to the targ^ 
ceU. 

10 ' 2. A fiision protein according to claim 1, wherein die targeted nucleic add is a 

double-stranded nucleic acid. 

3. A fiision protein according to claim 1, wherein die targeted nucleic acid is a 
single-stranded nucleic add. 

15 

4. A fusion protein according to daim 1, wherein die targeted nucleic acid is 
DNA or an analog dieieof . 

5. A fusion protein according to daim 1, wherein the targ^ed mideic acid is 
20 RNA or an analog thereof. 

6. A fusion protein according to claim 1, wherein the targeted nucleic acid is in 
the form of a recombinant expression vector comprisinig a nudeotide sequence to be 
expressed in die target cdl. 

•25 • ■ 

7. A fusion protein according to claim 6, whoein die nucleotide sequence to be 
e3q>ressed is a nucleotide sequence diat is not ixnnially esq;)^^ 

8. A fiision protein according to claim 6, wherein die nudeotide sequence to be 
30 expressed is an <>ntiiBgfwg coi^ of a nudeotide sequence present in the target cell. 
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9. A fusion protein accordiqg to claim 1, wherein said GDFP further comprises 
a flexible polypeptide linker sequence ("flexon") between said nucleic acid binding 
domain and said gene delivery domain or within one of said domains. 

10. A fusion protein according to claim 1, wherein said NBD comprises a 
nucleic acid binding component of a sequence-q>ecific nucleic acid binding protein. 

11. A fusion protein according to claim 1, wherein said NBD conqnrises a 
nucleic acid binding conqponent of a sequence-non-specific nuclei acid binding proteiiL 

12. A fusion protein according to claim 1, wherein said NBD conqirises a 
multiplici^ of nucleic acid binding (NB) conqx>nents that bind one or more targeted 
nucleic adds. 

13. A fusion protein according to claim 12, i;^ierein said NBD comprises at 
least two NB oonqmnents having diffiering binding specificities. 

14. A fusion protein according to claim 12, vrtierein die NBD conqnises a first 
NB conqx>nent enable of binding to a specific cognate recognition iwpigiiPft pr e sent in 
the targeted nucleic acid and a second NB conqionent capable of binding non- 
specificaUly to the targeted nucleic acid. 

15. A fusion protein according to claim 1, wherein die NBD further conqnises 
a component capable of mediafiT^ condensation and/or charge neutralization of the 
targmed nucleic acid. 

16. A fusion protein according to claim 1» wherein said gene delivery domain 
(GDD) comprises one or more components Oat fiacilitate delivery of a targeted nucleic 
acid to a target ceU. 

17. A fusion protein according to claim 16, wherein said components diat 
facilitate delivery of a targeted nucleic acid to a target ceU are selected £rom die group 
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consistiiig of a binding/targeting conq>oi]eiit« a membrane-disnipting coiiq)oiieiit, a 
transport/localization component and a leplicon integration component. 

18. A fusion protein according to claim 16, wherein said GDD comprises two 
5 or more conq>onents that facilitate delivery of a targeted nucleic acid to a target ceil, 

said components selected from the group consistiiv of a bindii^targetiqg conqKment, a 
membrane-disruptiqg component, a tranqx>rt/localization conqKment and a rqilioon 
integration conqx>nent. 

10 19. A fusion protein according to claim 16, wherein said GDD comprises a 

binding/targetiQg component* 

20. A fusion protein according to daim 16, wherein said GDD comprises a 
menibnme disrupting component. 

IS 

21. A fusion protein accordfaqg to daim 16, wherein said GDD comprises a 
transport/localization componenL 

22. A fusion protein accordii^ to claim 16, wherein said GDD conqirises a 
20 replicon integration ccnnponcoL 

23. A fusion protein according to claim 22, wherein said rq)licon integration 
campaoast is an integrase enzyme or a derivative diercof diat retains integrase activity. 

25 24. A macrbmolecular conq)lex useful in deliveriqg a targeted nucleic aad to a 

target cell, cbnqnrising a gene delivery fusion protein (GDFP) of claim 1 in association 
with a targeted . nucleic acid. 

25. A macromotecttlar conq>lex acoonUsig to cla^ 
30 cooQirises a replicon integration conqxxnent. 
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26. A macromolecular coaq)Iex according to claim 2S, wherein said replicon 
integration component comprises a recombinase emyme or a derivative thereof that 
retains recombinase activity, and wherein the targeted nucleic acid comprises NBD 
cognate recognition sequences in proximiQr to terminal recombinase recognition 

5 sequences. 

27. A macromolecular conqslex according to claim 26, wherein said 
recombinase is an integrase enzyme or a derivative Aereof that retains integrase • 
activity, and wherein the targeted nucleic acid comprises NBD cognate recognition 

10 sequences in proximiQr to tenninal integrase recognition sequences. 

28. A recombinant polynucleotide useful for pi ep atii g a gene delivery fusion 
protein, said polynucleotide conq>risi]« a coding sequence that encodes a GDFP of 
claim 1. 

IS 

29. The recombinant polynucleotide of clahn 28, whmin said polynucleotide is 
in the form of an expression vector conqnising a transcriptional control region qperably 
linked to said coding sequence, 

20 30. A cell useful in preparing a gene delivery fusion protein, said cell 

containing an esqxression vector of claim 29. 

31. A m^od of using a recombinant polynucleotide of claim 29 to produce a 
GDFP, said method comprising tiie steps of Ci^iifring tbe recombinant polynucleotide to 

25 be transcribed and translated, and recovering a GDFP. 

32. A metiiod of usng a GDFP of claim 1 to deliver said targeted nucleic acid 
to a target cell, the method comprising die steps of contacting die GDFP widi tiiB 
targeted nucleic acid to produce a GDFP/nudeic acid oonq>lex and contacting said 

30 GDFP/nudeic ac»l complex widi die target cell. 



33. AceUprodncedl^tfaeinethodof claim 32 and prp^igf dam 
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34. The cell of claim 33 wherein the taigeted nucleic acid is expressed in die 
cell as an RNA molecule selected ftom the group consisting of an RNA transcrqit, an 
antisense RNA, an RNA decoy and a ribozyme. 

S 35. A method of using a GDFP of claim 23 to deliver said taigeted nucleic 

acid to a target cell, the method con^risipg Ae stqps of contactiqg the GDFP with the 
targeted nucleic acid to produce a GDFP/nucleic acid complex and contactiqg said 
GDFP/nucleic acid con^^toi with the taiget cell. 

10 36. A cell produced by the mediod of claim 35 and progeny thereof, said^ 

conqnising an integrated cc^ of said targeted nucleic acid. 
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